Vol Building AGI – Telegram

Vol Building AGI

580 subscribers

116 photos

9 videos

12 files

199 links

Past topics: speech synthesis, transformers, LSTM, recurrence

Download Telegram

About

Blog

Apps

Platform

Vol Building AGI

580 subscribers

Vol Building AGI

Channel created

08:47

Vol Building AGI

Channel name was changed to «ICASSP 2022 vc and tts»

08:50

Vol Building AGI

Channel name was changed to «ICASSP 2022 Voice Conversion and Synthesis»

08:50

Vol Building AGI

https://github.com/biggytruck/SpeechSplit2

https://arxiv.org/abs/2203.14156

GitHub - biggytruck/SpeechSplit2: Official implementation of SpeechSplit2

Official implementation of SpeechSplit2. Contribute to biggytruck/SpeechSplit2 development by creating an account on GitHub.

34 views08:53

Vol Building AGI

35 views08:53

Vol Building AGI

34 views08:53

Vol Building AGI

34 views08:53

Vol Building AGI

Media is too big

VIEW IN TELEGRAM

34 views08:55

Vol Building AGI

35 views08:55

Vol Building AGI

How to augment speech content (likely usable as recognition augmentations too)

35 views09:01

Vol Building AGI

Vol Building AGI

signal processing revealed

34 views09:03

Vol Building AGI

VTLP code: https://github.com/biggytruck/SpeechSplit2/blob/0911c09732e0e935c7c0a7aaf23eb2923d9889d8/utils.py#L252-L276

SpeechSplit2/utils.py at 0911c09732e0e935c7c0a7aaf23eb2923d9889d8 · biggytruck/SpeechSplit2

Official implementation of SpeechSplit2. Contribute to biggytruck/SpeechSplit2 development by creating an account on GitHub.

34 views09:09

Vol Building AGI

New SOTA on TTS from Microsoft Research Asia (outside of ICASSP)

Uses 24 hours (13100 utterances) from LJSpeech, 200M text sentences for phoneme encoder pretraining and a g2p model. 8 V100 GPUs. 3000 epochs.

https://speechresearch.github.io/naturalspeech/

33 views11:24

Vol Building AGI

In the mean time all Interspeech 2021 videos have been made available https://www.superlectures.com/interspeech2021/tutorials

https://www.youtube.com/channel/UC2-z0HD4WpSbJONj73BgfwQ/videos

33 viewsedited 14:23

Vol Building AGI

https://www.youtube.com/watch?v=-p_awLZWLeI

https://github.com/facebookresearch/vocoder-benchmark

VocBench from Facebook

Autoregressive vocoders: WaveNet, WaveRNN
GANs: Parallel WaveGAN, MelGAN
Diffusion: WaveGrad, DiffWave

All in one place with a common input-output interface with modern codebase from Facebook.

Might be useful for VC if it’s easy to make condition those vocoders using custom features.

36 viewsedited 14:53