netease-youdao/EmotiVoice
EmotiVoice ๐: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
Total stars: 240
Stars trend:
10 Nov 2023
#python
#ai, #deeplearning, #emotion, #emotivoice, #multispeaker, #prompt, #python, #pytorch, #speech, #speechsynthesis, #style, #texttospeech, #tts
EmotiVoice ๐: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
Total stars: 240
Stars trend:
10 Nov 2023
1am โ +1
2am +0
3am โ +1
4am โ +2
5am โโ +10
6am โโโโโโโโโโโ +88
7am โโโ +20
8am โโ +10
9am โ +5
10am โ +6
11am โโโโ +27
12pm โโโโโ +36
#python
#ai, #deeplearning, #emotion, #emotivoice, #multispeaker, #prompt, #python, #pytorch, #speech, #speechsynthesis, #style, #texttospeech, #tts
jianchang512/stt
Voice Recognition to Text Tool / ไธไธช็ฆป็บฟ่ฟ่ก็ๆฌๅฐ่ฏญ้ณ่ฏๅซ่ฝฌๆๅญๆๅก๏ผ่พๅบjsonใsrtๅญๅนๅธฆๆถ้ดๆณใ็บฏๆๅญๆ ผๅผ
Language:Python
Total stars: 233
Stars trend:
#python
#speech, #speechrecognition, #speechtotext, #stt
Voice Recognition to Text Tool / ไธไธช็ฆป็บฟ่ฟ่ก็ๆฌๅฐ่ฏญ้ณ่ฏๅซ่ฝฌๆๅญๆๅก๏ผ่พๅบjsonใsrtๅญๅนๅธฆๆถ้ดๆณใ็บฏๆๅญๆ ผๅผ
Language:Python
Total stars: 233
Stars trend:
5 Jan 2024
12am โ +6
1am โโโโโ +36
2am โโโโ +28
3am โโโ +19
4am โ +8
5am โโ +13
6am โโโ +17
7am โโ +12
8am โ +5
9am โ +8
#python
#speech, #speechrecognition, #speechtotext, #stt
ictnlp/StreamSpeech
StreamSpeech is an โAll in Oneโ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
StreamSpeech is an โAll in Oneโ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm โ +1
10pm โ +1
11pm โ +2
18 Jun 2024
12am +0
1am โ +5
2am โ +3
3am โโ +11
4am โโโ +24
5am โโ +13
6am โ +8
7am โ +7
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python
Total stars: 556
Stars trend:
#python
#deeplearning, #pytorch, #speech, #speechprocessing, #speechsynthesis, #texttospeech, #toolkit, #tts
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python
Total stars: 556
Stars trend:
19 Jun 2024
8pm โ +3
9pm โโโ +20
10pm โโโ +22
11pm โ +6
20 Jun 2024
12am โโ +13
1am โโ +13
#python
#deeplearning, #pytorch, #speech, #speechprocessing, #speechsynthesis, #texttospeech, #toolkit, #tts
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python
Total stars: 738
Stars trend:
#python
#prosody, #speech, #speechsynthesis, #texttospeech, #voicecloneai, #voicecloning
MARS5 speech model (TTS) from CAMB.AI
Language:Python
Total stars: 738
Stars trend:
24 Jun 2024
7pm โ +1
8pm โโโ +18
9pm โโโโโ +39
10pm โโโโโ +36
#python
#prosody, #speech, #speechsynthesis, #texttospeech, #voicecloneai, #voicecloning
yeyupiaoling/MASR
Pytorchๅฎ็ฐ็ๆตๅผไธ้ๆตๅผ็่ชๅจ่ฏญ้ณ่ฏๅซๆกๆถ๏ผๅๆถๅ ผๅฎนๅจ็บฟๅ็ฆป็บฟ่ฏๅซ๏ผ็ฎๅๆฏๆConformerใSqueezeformerใDeepSpeech2ๆจกๅ๏ผๆฏๆๅค็งๆฐๆฎๅขๅผบๆนๆณใ
Language:Python
Total stars: 580
Stars trend:
#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer
Pytorchๅฎ็ฐ็ๆตๅผไธ้ๆตๅผ็่ชๅจ่ฏญ้ณ่ฏๅซๆกๆถ๏ผๅๆถๅ ผๅฎนๅจ็บฟๅ็ฆป็บฟ่ฏๅซ๏ผ็ฎๅๆฏๆConformerใSqueezeformerใDeepSpeech2ๆจกๅ๏ผๆฏๆๅค็งๆฐๆฎๅขๅผบๆนๆณใ
Language:Python
Total stars: 580
Stars trend:
3 Aug 2024
2pm โโโโโโโโโโโโ +90
3pm +0
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
4 Aug 2024
12am +0
1am โ +1
#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python
Total stars: 2735
Stars trend:
#python
#ai, #assistant, #languagemodel, #machinelearning, #python, #speech, #speechsynthesis, #speechtotext, #speechtranslation
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python
Total stars: 2735
Stars trend:
3 Sep 2024
1am โ +3
2am +0
3am +0
4am โ +1
5am +0
6am โ +4
7am โโ +9
8am โโโ +18
9am โโโ +19
10am โโ +9
11am โ +6
12pm โโ +11
#python
#ai, #assistant, #languagemodel, #machinelearning, #python, #speech, #speechsynthesis, #speechtotext, #speechtranslation
๐1
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
3 Sep 2024
9am โ +1
10am +0
11am +0
12pm โ +1
1pm โ +1
2pm โ +4
3pm โ +5
4pm โ +4
5pm โโ +10
6pm โโโ +17
7pm โโโ +18
8pm โโโโ +25
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
Orenoid/BabelDuck
ๆด้ๅๆฐๆ็ AI ๅฃ่ฏญๅฏน่ฏ็ปไน ๅบ็จ / Beginner-friendly AI conversation practice application
Language:TypeScript
Total stars: 167
Stars trend:
#typescript
#ai, #chatbot, #chatgpt, #conversation, #languagelearning, #languagepractice, #openai, #speech
ๆด้ๅๆฐๆ็ AI ๅฃ่ฏญๅฏน่ฏ็ปไน ๅบ็จ / Beginner-friendly AI conversation practice application
Language:TypeScript
Total stars: 167
Stars trend:
13 Dec 2024
12am โ +8
1am โโโโโ +36
2am โโโโ +32
3am โโโโ +25
4am โ +6
#typescript
#ai, #chatbot, #chatgpt, #conversation, #languagelearning, #languagepractice, #openai, #speech
fixie-ai/ultravox
A fast multimodal LLM for real-time voice
Language:Python
Total stars: 2455
Stars trend:
#python
#ai, #llm, #slm, #speech
A fast multimodal LLM for real-time voice
Language:Python
Total stars: 2455
Stars trend:
13 Jan 2025
5am โโ +9
6am โโ +16
7am โโ +14
8am โโโ +17
9am โโ +16
10am โ +7
11am โโ +12
12pm โโ +15
1pm โโ +15
2pm โโ +14
3pm โโ +14
4pm โโ +11
#python
#ai, #llm, #slm, #speech
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
6 May 2025
4am โ +6
5am โโ +9
6am โ +6
7am โ +4
8am โโ +9
9am โ +6
10am โ +5
11am โโ +9
12pm โ +3
1pm โ +3
2pm โโ +10
3pm โ +7
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper