Code Stars

voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Language: Python
Total stars: 2612
Stars trend:
7 May 2023

 7pm ▎ +2

 8pm  +0

 9pm ▎ +2

10pm ▉ +7

11pm █▍ +11

8 May 2023

12am █▉ +15

 1am ▉ +7

 2am █▍ +11

 3am █▋ +13

 4am █▋ +13

 5am █████▌ +44

 6am █████▍ +43

#python
#contentvec, #deeplearning, #gan, #hubert, #lightning, #pytorch, #pytorchlightning, #realtime, #sovitssvc, #softvc, #sovits, #speechsynthesis, #vits, #voicechanger, #voiceconversion

32 views07:17

Code Stars

coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language: Python
Total stars: 15436
Stars trend:
17 Sep 2023

 3am ▍ +3

 4am ▎ +2

 5am ▍ +3

 6am ▏ +1

 7am ▌ +4

 8am ▎ +2

 9am ▌ +4

10am ▎ +2

11am ███▋ +29

12pm █████▍ +43

 1pm ██████▍ +51

 2pm ███████ +56

#python
#deeplearning, #glowtts, #hifigan, #melgan, #multispeakertts, #python, #pytorch, #speakerencoder, #speakerencodings, #speech, #speechsynthesis, #tacotron, #texttospeech, #tts, #ttsmodel, #vocoder, #voicecloning, #voiceconversion, #voicesynthesis

❤2

93 views15:17

Code Stars

netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
Total stars: 240
Stars trend:
10 Nov 2023

 1am ▏ +1

 2am  +0

 3am ▏ +1

 4am ▎ +2

 5am █▎ +10

 6am ███████████ +88

 7am ██▌ +20

 8am █▎ +10

 9am ▋ +5

10am ▊ +6

11am ███▍ +27

12pm ████▌ +36

#python
#ai, #deeplearning, #emotion, #emotivoice, #multispeaker, #prompt, #python, #pytorch, #speech, #speechsynthesis, #style, #texttospeech, #tts

48 views13:17

Code Stars

yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python
Total stars: 568
Stars trend:

19 Nov 2023
 4pm ▏ +1
 5pm ▎ +2
 6pm ████████████▏ +97
 7pm ███████████████▍ +123

#python
#adversarialtraining, #deeplearning, #diffusionmodels, #gan, #latentdiffusion, #latentdiffusionmodels, #pytorch, #speakeradaptation, #speechsynthesis, #texttospeech, #tts, #wavlm

52 views20:17

Code Stars

open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python
Total stars: 903
Stars trend:

19 Dec 2023
 9am █▍ +11
10am ▋ +5
11am ▉ +7
12pm ██ +16
 1pm █▋ +13
 2pm █▋ +13
 3pm ██▍ +19
 4pm █▌ +12
 5pm ██ +16
 6pm █▍ +11
 7pm ██▎ +18
 8pm █▌ +12

#python
#audiogeneration, #audiosynthesis, #audioldm, #audit, #fastspeech2, #hifigan, #musicgeneration, #naturalspeech2, #singingvoiceconversion, #speechsynthesis, #texttoaudio, #texttospeech, #valle, #vits, #voiceconversion

183 views21:17

Code Stars

collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook
Total stars: 571
Stars trend:

17 Jan 2024
 7pm ▏ +1
 8pm ▏ +1
 9pm  +0
10pm  +0
11pm  +0
18 Jan 2024
12am  +0
 1am  +0
 2am  +0
 3am █▋ +13
 4am ██████▍ +51
 5am █████▏ +41
 6am █████▌ +44

#jupyternotebook
#pytorch, #speechsynthesis, #tts

193 views07:19

Code Stars

espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C
Total stars: 2997
Stars trend:

2 May 2024
12am ▏ +1
 1am  +0
 2am ██ +16
 3am █▌ +12
 4am ██▋ +21
 5am ███▍ +27

#c
#android, #espeak, #espeakng, #speechsynthesis, #texttospeech

👍1

224 views06:35

Code Stars

ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:

17 Jun 2024
 9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am  +0
 1am ▋ +5
 2am ▍ +3
 3am █▍ +11
 4am ███ +24
 5am █▋ +13
 6am █ +8
 7am ▉ +7

#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice

116 views08:17

Code Stars

DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python
Total stars: 556
Stars trend:

19 Jun 2024
 8pm ▍ +3
 9pm ██▌ +20
10pm ██▊ +22
11pm ▊ +6
20 Jun 2024
12am █▋ +13
 1am █▋ +13

#python
#deeplearning, #pytorch, #speech, #speechprocessing, #speechsynthesis, #texttospeech, #toolkit, #tts

103 views02:17

Code Stars

Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python
Total stars: 738
Stars trend:

24 Jun 2024
 7pm ▏ +1
 8pm ██▎ +18
 9pm ████▉ +39
10pm ████▌ +36

#python
#prosody, #speech, #speechsynthesis, #texttospeech, #voicecloneai, #voicecloning

105 views23:17

Code Stars

rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 4321
Stars trend:

29 Jun 2024
12pm ▍ +3
 1pm █▉ +15
 2pm █▋ +13
 3pm █▌ +12
 4pm ██▏ +17
 5pm ▊ +6
 6pm ▍ +3
 7pm ▍ +3
 8pm ▏ +1
 9pm  +0
10pm ▌ +4

#python
#speechsynthesis, #texttospeech, #tts

119 views23:17

Code Stars

huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python
Total stars: 2735
Stars trend:

3 Sep 2024
 1am ▍ +3
 2am  +0
 3am  +0
 4am ▏ +1
 5am  +0
 6am ▌ +4
 7am █▏ +9
 8am ██▎ +18
 9am ██▍ +19
10am █▏ +9
11am ▊ +6
12pm █▍ +11

#python
#ai, #assistant, #languagemodel, #machinelearning, #python, #speech, #speechsynthesis, #speechtotext, #speechtranslation

👍1

127 views13:18

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:

9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
 1am █▏ +9
 2am ██▏ +17
 3am █▎ +10
 4am ▉ +7
 5am ▊ +6
 6am ▍ +3
 7am ▌ +4
 8am ▌ +4
 9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp

105 views10:17

Code Stars

abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:

21 Jan 2025
 6am ██▍ +19
 7am ▎ +2
 8am ▌ +4
 9am ▍ +3
10am  +0
11am ▋ +5
12pm ▌ +4
 1pm ▊ +6
 2pm █▋ +13
 3pm ▉ +7
 4pm ▉ +7
 5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp

👍1

112 views18:17

Code Stars

21 Jan 2025
10am ▍ +3
11am ▋ +5
12pm █▎ +10
 1pm ▌ +4
 2pm █ +8
 3pm ▊ +6
 4pm █▏ +9
 5pm █▋ +13
 6pm ▌ +4
 7pm ▋ +5
 8pm ▎ +2
 9pm █ +8

#python
#audiogeneration, #audiosynthesis, #audioldm, #audit, #emilia, #fastspeech2, #maskgct, #musicgeneration, #naturalspeech2, #singingvoiceconversion, #speechsynthesis, #texttoaudio, #texttospeech, #valle, #vits, #vocoder, #voiceconversion

105 views22:18

Code Stars

rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 6978
Stars trend:

23 Jan 2025
 2am ▏ +1
 3am ▊ +6
 4am █▊ +14
 5am █▉ +15
 6am █▍ +11
 7am ██ +16
 8am █▏ +9
 9am █▎ +10

#python
#speechsynthesis, #texttospeech, #tts

85 views10:17

Code Stars

Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:

8 May 2025
 5am ▍ +3
 6am ▍ +3
 7am ▏ +1
 8am ▍ +3
 9am  +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
 1pm ▍ +3
 2pm █▊ +14
 3pm █▊ +14
 4pm █ +8

#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers

1🔥1

105 views17:17

Code Stars

NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python
Total stars: 13977
Stars trend:

8 May 2025
11am ▉ +7
12pm █▉ +15
 1pm ▉ +7
 2pm █▏ +9
 3pm ▉ +7
 4pm ▉ +7
 5pm ▊ +6
 6pm ▋ +5
 7pm ▍ +3
 8pm ▌ +4
 9pm ▍ +3
10pm ▉ +7

#python
#asr, #deeplearning, #generativeai, #largelanguagemodels, #machinetranslation, #multimodal, #neuralnetworks, #speakerdiariazation, #speakerrecognition, #speechsynthesis, #speechtranslation, #tts

102 views23:19

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:

27 Jun 2025
 8am ▍ +3
 9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
 1pm █▉ +15
 2pm █▊ +14
 3pm ▉ +7
 4pm █ +8
 5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp

124 views18:17

Code Stars

denizsafak/abogen
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Language:Python
Total stars: 1242
Stars trend:

10 Aug 2025
 6am ██▌ +20
 7am █████▋ +45
 8am ███████▍ +59
 9am █████▊ +46
10am ███▎ +26

#python
#audiobook, #audiobooks, #contentcreation, #contentcreator, #epubconverter, #kokoro, #kokoro82m, #kokorotts, #mediageneration, #narrator, #speechsynthesis, #subtitles, #texttoaudio, #texttospeech, #tts, #voicesynthesis

112 views11:17

About

Blog

Apps

Platform