Code Stars

toverainc/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Language: C
Total stars: 255
Stars trend:
15 May 2023

 8am ▏ +1

 9am  +0

10am  +0

11am  +0

12pm  +0

 1pm  +0

 2pm █████▋ +45

 3pm ███████████▋ +93

 4pm █████ +40

#c
#alexa, #deeplearning, #echo, #espadf, #espidf, #esp32, #googlehome, #homeassistant, #homeautomation, #privacy, #speechrecognition, #speechtotext, #whisper

33 views17:17

Code Stars

guillaumekln/faster-whisper
Faster Whisper transcription with CTranslate2
Language: Python
Total stars: 3284
Stars trend:
19 Jul 2023

 1am ▍ +3

 2am ▍ +3

 3am ▎ +2

 4am ██▌ +20

 5am ██████▊ +54

 6am █████▏ +41

 7am ██▏ +17

 8am ██▉ +23

 9am █▍ +11

10am █ +8

11am ▉ +7

12pm █▍ +11

#python
#deeplearning, #inference, #openai, #quantization, #speechrecognition, #speechtotext, #transformer, #whisper

27 views13:17

Code Stars

huggingface/distil-whisper

Total stars: 170
Stars trend:
31 Oct 2023

 5pm ▋ +5

 6pm ████▋ +37

 7pm ███▍ +27

 8pm ███▍ +27

 9pm ██▍ +19

10pm ███ +24

11pm ██▌ +20

#audio, #speechrecognition, #whisper

👍1

61 views00:18

Code Stars

jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务，输出json、srt字幕带时间戳、纯文字格式
Language:Python
Total stars: 233
Stars trend:

5 Jan 2024
12am ▊ +6
 1am ████▌ +36
 2am ███▌ +28
 3am ██▍ +19
 4am █ +8
 5am █▋ +13
 6am ██▏ +17
 7am █▌ +12
 8am ▋ +5
 9am █ +8

#python
#speech, #speechrecognition, #speechtotext, #stt

214 views10:17

Code Stars

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python
Total stars: 7392
Stars trend:

28 Feb 2024
 2pm ▏ +1
 3pm ██ +16
 4pm █▌ +12
 5pm █▋ +13
 6pm █▌ +12
 7pm ▉ +7

#python
#asr, #audio, #audioprocessing, #deeplearning, #huggingface, #languagemodel, #pytorch, #speakerdiarization, #speakerrecognition, #speakerverification, #speechenhancement, #speechprocessing, #speechrecognition, #speechseparation, #speechtotext, #speechtoolkit, #speechrecognition, #spokenlanguageunderstanding, #transformers, #voicerecognition

75 views20:17

Code Stars

alibaba-damo-academy/FunClip
一款基于FunASR高准确率开源语音识别模型的智能视频剪辑工具 / A video clipping tool based on FunASR open source model and Gradio.
Language:Shell
Total stars: 411
Stars trend:

21 Apr 2024
 6am ▊ +6
 7am ▊ +6
 8am █ +8
 9am ▋ +5
10am ▊ +6
11am ▌ +4
12pm ▌ +4
 1pm ▊ +6
 2pm █▍ +11
 3pm █▌ +12
 4pm ▌ +4
 5pm ▍ +3

#shell
#speechrecognition, #subtitlesgenerator, #videoclip, #videosubtitles

110 views18:17

Code Stars

transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:

26 May 2024
 7pm ▊ +6
 8pm █████▍ +43
 9pm ███▉ +31
10pm ██▊ +22

#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx

120 views23:17

Code Stars

ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:

17 Jun 2024
 9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am  +0
 1am ▋ +5
 2am ▍ +3
 3am █▍ +11
 4am ███ +24
 5am █▋ +13
 6am █ +8
 7am ▉ +7

#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice

116 views08:17

Code Stars

mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:

21 Jun 2024
12am ▍ +3
 1am ██ +16
 2am █▊ +14
 3am █ +8
 4am █▊ +14
 5am ██▎ +18
 6am ██▎ +18
 7am ██▍ +19

#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper

❤1

106 views08:17

Code Stars

yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。
Language:Python
Total stars: 580
Stars trend:

3 Aug 2024
 2pm ███████████▎ +90
 3pm  +0
 4pm  +0
 5pm  +0
 6pm  +0
 7pm  +0
 8pm  +0
 9pm  +0
10pm  +0
11pm  +0
4 Aug 2024
12am  +0
 1am ▏ +1

#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer

106 views02:18

Code Stars

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:

3 Sep 2024
 9am ▏ +1
10am  +0
11am  +0
12pm ▏ +1
 1pm ▏ +1
 2pm ▌ +4
 3pm ▋ +5
 4pm ▌ +4
 5pm █▎ +10
 6pm ██▏ +17
 7pm ██▎ +18
 8pm ███▏ +25

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper

98 views21:19

Code Stars

kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML
Total stars: 12193
Stars trend:

8 Oct 2024
 3pm  +1
 4pm  +0
 5pm  +0
 6pm  +0
 7pm  +0
 8pm  +0
 9pm  +0
10pm  +0
11pm  +0
9 Oct 2024
12am  +0
 1am  +0
 2am ██████████████████▊ +162

#html
#artificialintelligencealgorithms, #artificialneuralnetworks, #bayesianstatistics, #computervision, #deeplearning, #deepneuralnetworks, #deepreinforcementlearning, #explainableai, #geometricdeeplearning, #graphneuralnetworks, #machinelearning, #medicalimaging, #naturallanguageprocessing, #optimization, #patternrecognition, #probabilisticgraphicalmodels, #probability, #reinforcementlearning, #speechrecognition, #visualrecognition

❤1

135 views03:17

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:

9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
 1am █▏ +9
 2am ██▏ +17
 3am █▎ +10
 4am ▉ +7
 5am ▊ +6
 6am ▍ +3
 7am ▌ +4
 8am ▌ +4
 9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp

105 views10:17

Code Stars

abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:

21 Jan 2025
 6am ██▍ +19
 7am ▎ +2
 8am ▌ +4
 9am ▍ +3
10am  +0
11am ▋ +5
12pm ▌ +4
 1pm ▊ +6
 2pm █▋ +13
 3pm ▉ +7
 4pm ▉ +7
 5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp

👍1

112 views18:17

Code Stars

amanvirparhar/chaplin
A real-time silent speech recognition tool.
Language:Python
Total stars: 84
Stars trend:

3 Feb 2025
12am ▏ +1
 1am █▍ +11
 2am ▊ +6
 3am ▌ +4
 4am ▊ +6
 5am █▏ +9
 6am ▌ +4
 7am ███▏ +25
 8am █▎ +10

#python
#autoavsr, #avsr, #llm, #ollama, #speechrecognition, #speechtotext, #vsr

❤1

213 views09:19

Code Stars

modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
Total stars: 9523
Stars trend:

8 Apr 2025
 2pm ▉ +7
 3pm █▏ +9
 4pm ▋ +5
 5pm ▋ +5
 6pm ▌ +4
 7pm ▊ +6
 8pm █ +8
 9pm ▎ +2
10pm ▍ +3
11pm ▊ +6
9 Apr 2025
12am ▊ +6
 1am ██▏ +17

#python
#audiovisualspeechrecognition, #conformer, #dfsmn, #paraformer, #pretrainedmodel, #punctuation, #pytorch, #rnnt, #speakerdiarization, #speechrecognition, #speechgpt, #speechllm, #vad, #voiceactivitydetection, #whisper

84 views02:18

Code Stars

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:

6 May 2025
 4am ▊ +6
 5am █▏ +9
 6am ▊ +6
 7am ▌ +4
 8am █▏ +9
 9am ▊ +6
10am ▋ +5
11am █▏ +9
12pm ▍ +3
 1pm ▍ +3
 2pm █▎ +10
 3pm ▉ +7

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper

94 views16:17

Code Stars

Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:

8 May 2025
 5am ▍ +3
 6am ▍ +3
 7am ▏ +1
 8am ▍ +3
 9am  +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
 1pm ▍ +3
 2pm █▊ +14
 3pm █▊ +14
 4pm █ +8

#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers

1🔥1

105 views17:17

Code Stars

alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook
Total stars: 10057
Stars trend:

7 Jun 2025
 7pm ▍ +3
 8pm ▋ +5
 9pm ▎ +2
10pm ▊ +6
11pm ▉ +7
8 Jun 2025
12am ▉ +7
 1am ▉ +7
 2am ▉ +7
 3am █ +8
 4am █▎ +10
 5am ▋ +5
 6am █▏ +9

#jupyternotebook
#android, #asr, #deeplearning, #deepneuralnetworks, #deepspeech, #googlespeechtotext, #ios, #kaldi, #offline, #privacy, #python, #raspberrypi, #speakeridentification, #speakerverification, #speechrecognition, #speechtotext, #speechtotextandroid, #stt, #voicerecognition, #vosk

113 views07:17

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:

27 Jun 2025
 8am ▍ +3
 9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
 1pm █▉ +15
 2pm █▊ +14
 3pm ▉ +7
 4pm █ +8
 5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp

124 views18:17

About

Blog

Apps

Platform