Code Stars
1.88K subscribers
8.49K photos
8.78K links
Code Stars provides notifications about GitHub repositories that are gaining a significant number of stars in a short period of time. Be the first to find out about trending repositories that everybody will be talking about soon.
#AI #chatGPT #python
Download Telegram
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Language: Python
Total stars: 2612
Stars trend:
7 May 2023
 7pm ▎ +2

 8pm  +0

 9pm ▎ +2

10pm ▉ +7

11pm █▍ +11

8 May 2023
12am █▉ +15

 1am ▉ +7

 2am █▍ +11

 3am █▋ +13

 4am █▋ +13

 5am █████▌ +44

 6am █████▍ +43

#python
#contentvec, #deeplearning, #gan, #hubert, #lightning, #pytorch, #pytorchlightning, #realtime, #sovitssvc, #softvc, #sovits, #speechsynthesis, #vits, #voicechanger, #voiceconversion
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language: Python
Total stars: 15436
Stars trend:
17 Sep 2023
 3am ▍ +3

 4am ▎ +2

 5am ▍ +3

 6am ▏ +1

 7am ▌ +4

 8am ▎ +2

 9am ▌ +4

10am ▎ +2

11am ███▋ +29

12pm █████▍ +43

 1pm ██████▍ +51

 2pm ███████ +56

#python
#deeplearning, #glowtts, #hifigan, #melgan, #multispeakertts, #python, #pytorch, #speakerencoder, #speakerencodings, #speech, #speechsynthesis, #tacotron, #texttospeech, #tts, #ttsmodel, #vocoder, #voicecloning, #voiceconversion, #voicesynthesis
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
Total stars: 240
Stars trend:
10 Nov 2023
 1am ▏ +1

 2am  +0

 3am ▏ +1

 4am ▎ +2

 5am █▎ +10

 6am ███████████ +88

 7am ██▌ +20

 8am █▎ +10

 9am ▋ +5

10am ▊ +6

11am ███▍ +27

12pm ████▌ +36

#python
#ai, #deeplearning, #emotion, #emotivoice, #multispeaker, #prompt, #python, #pytorch, #speech, #speechsynthesis, #style, #texttospeech, #tts
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python
Total stars: 568
Stars trend:
19 Nov 2023
4pm ▏ +1
5pm ▎ +2
6pm ████████████▏ +97
7pm ███████████████▍ +123

#python
#adversarialtraining, #deeplearning, #diffusionmodels, #gan, #latentdiffusion, #latentdiffusionmodels, #pytorch, #speakeradaptation, #speechsynthesis, #texttospeech, #tts, #wavlm
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python
Total stars: 903
Stars trend:
19 Dec 2023
9am █▍ +11
10am ▋ +5
11am ▉ +7
12pm ██ +16
1pm █▋ +13
2pm █▋ +13
3pm ██▍ +19
4pm █▌ +12
5pm ██ +16
6pm █▍ +11
7pm ██▎ +18
8pm █▌ +12

#python
#audiogeneration, #audiosynthesis, #audioldm, #audit, #fastspeech2, #hifigan, #musicgeneration, #naturalspeech2, #singingvoiceconversion, #speechsynthesis, #texttoaudio, #texttospeech, #valle, #vits, #voiceconversion
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook
Total stars: 571
Stars trend:
17 Jan 2024
7pm ▏ +1
8pm ▏ +1
9pm +0
10pm +0
11pm +0
18 Jan 2024
12am +0
1am +0
2am +0
3am █▋ +13
4am ██████▍ +51
5am █████▏ +41
6am █████▌ +44

#jupyternotebook
#pytorch, #speechsynthesis, #tts
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C
Total stars: 2997
Stars trend:
2 May 2024
12am ▏ +1
1am +0
2am ██ +16
3am █▌ +12
4am ██▋ +21
5am ███▍ +27

#c
#android, #espeak, #espeakng, #speechsynthesis, #texttospeech
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am +0
1am ▋ +5
2am ▍ +3
3am █▍ +11
4am ███ +24
5am █▋ +13
6am █ +8
7am ▉ +7

#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python
Total stars: 556
Stars trend:
19 Jun 2024
8pm ▍ +3
9pm ██▌ +20
10pm ██▊ +22
11pm ▊ +6
20 Jun 2024
12am █▋ +13
1am █▋ +13

#python
#deeplearning, #pytorch, #speech, #speechprocessing, #speechsynthesis, #texttospeech, #toolkit, #tts
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python
Total stars: 738
Stars trend:
24 Jun 2024
7pm ▏ +1
8pm ██▎ +18
9pm ████▉ +39
10pm ████▌ +36

#python
#prosody, #speech, #speechsynthesis, #texttospeech, #voicecloneai, #voicecloning
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 4321
Stars trend:
29 Jun 2024
12pm ▍ +3
1pm █▉ +15
2pm █▋ +13
3pm █▌ +12
4pm ██▏ +17
5pm ▊ +6
6pm ▍ +3
7pm ▍ +3
8pm ▏ +1
9pm +0
10pm ▌ +4

#python
#speechsynthesis, #texttospeech, #tts
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python
Total stars: 2735
Stars trend:
3 Sep 2024
1am ▍ +3
2am +0
3am +0
4am ▏ +1
5am +0
6am ▌ +4
7am █▏ +9
8am ██▎ +18
9am ██▍ +19
10am █▏ +9
11am ▊ +6
12pm █▍ +11

#python
#ai, #assistant, #languagemodel, #machinelearning, #python, #speech, #speechsynthesis, #speechtotext, #speechtranslation
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
1am █▏ +9
2am ██▏ +17
3am █▎ +10
4am ▉ +7
5am ▊ +6
6am ▍ +3
7am ▌ +4
8am ▌ +4
9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
21 Jan 2025
6am ██▍ +19
7am ▎ +2
8am ▌ +4
9am ▍ +3
10am +0
11am ▋ +5
12pm ▌ +4
1pm ▊ +6
2pm █▋ +13
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python
Total stars: 8221
Stars trend:
21 Jan 2025
10am ▍ +3
11am ▋ +5
12pm █▎ +10
1pm ▌ +4
2pm █ +8
3pm ▊ +6
4pm █▏ +9
5pm █▋ +13
6pm ▌ +4
7pm ▋ +5
8pm ▎ +2
9pm █ +8

#python
#audiogeneration, #audiosynthesis, #audioldm, #audit, #emilia, #fastspeech2, #maskgct, #musicgeneration, #naturalspeech2, #singingvoiceconversion, #speechsynthesis, #texttoaudio, #texttospeech, #valle, #vits, #vocoder, #voiceconversion
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 6978
Stars trend:
23 Jan 2025
2am ▏ +1
3am ▊ +6
4am █▊ +14
5am █▉ +15
6am █▍ +11
7am ██ +16
8am █▏ +9
9am █▎ +10

#python
#speechsynthesis, #texttospeech, #tts
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
8 May 2025
5am ▍ +3
6am ▍ +3
7am ▏ +1
8am ▍ +3
9am +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
1pm ▍ +3
2pm █▊ +14
3pm █▊ +14
4pm █ +8

#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python
Total stars: 13977
Stars trend:
8 May 2025
11am ▉ +7
12pm █▉ +15
1pm ▉ +7
2pm █▏ +9
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6
6pm ▋ +5
7pm ▍ +3
8pm ▌ +4
9pm ▍ +3
10pm ▉ +7

#python
#asr, #deeplearning, #generativeai, #largelanguagemodels, #machinetranslation, #multimodal, #neuralnetworks, #speakerdiariazation, #speakerrecognition, #speechsynthesis, #speechtranslation, #tts