Vol Building AGI
580 subscribers
116 photos
9 videos
12 files
199 links
Past topics: speech synthesis, transformers, LSTM, recurrence
Download Telegram
Channel created
Channel name was changed to «ICASSP 2022 vc and tts»
Channel name was changed to «ICASSP 2022 Voice Conversion and Synthesis»
How to augment speech content (likely usable as recognition augmentations too)
Vol Building AGI
Photo
signal processing revealed
New SOTA on TTS from Microsoft Research Asia (outside of ICASSP)

Uses 24 hours (13100 utterances) from LJSpeech, 200M text sentences for phoneme encoder pretraining and a g2p model. 8 V100 GPUs. 3000 epochs.

https://speechresearch.github.io/naturalspeech/