Code Stars
1.93K subscribers
9.34K photos
9.63K links
Code Stars alerts you to GitHub repos gaining stars rapidly. Stay ahead of the curve and discover trending projects before they go viral! #AI #GitHub #OpenSource #Tech #MachineLearning #Python #Programming #Java #Javascript #React #Docker #Devops
Download Telegram
alibaba-damo-academy/FunClip
一款基于FunASR高准确率开源语音识别模型的智能视频剪辑工具 / A video clipping tool based on FunASR open source model and Gradio.
Language:Shell
Total stars: 411
Stars trend:
21 Apr 2024
6am ▊ +6
7am ▊ +6
8am █ +8
9am ▋ +5
10am ▊ +6
11am ▌ +4
12pm ▌ +4
1pm ▊ +6
2pm █▍ +11
3pm █▌ +12
4pm ▌ +4
5pm ▍ +3

#shell
#speechrecognition, #subtitlesgenerator, #videoclip, #videosubtitles
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:
26 May 2024
7pm ▊ +6
8pm █████▍ +43
9pm ███▉ +31
10pm ██▊ +22

#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am +0
1am ▋ +5
2am ▍ +3
3am █▍ +11
4am ███ +24
5am █▋ +13
6am █ +8
7am ▉ +7

#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
21 Jun 2024
12am ▍ +3
1am ██ +16
2am █▊ +14
3am █ +8
4am █▊ +14
5am ██▎ +18
6am ██▎ +18
7am ██▍ +19

#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
1
yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
Language:Python
Total stars: 580
Stars trend:
3 Aug 2024
2pm ███████████▎ +90
3pm +0
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
4 Aug 2024
12am +0
1am ▏ +1

#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
3 Sep 2024
9am ▏ +1
10am +0
11am +0
12pm ▏ +1
1pm ▏ +1
2pm ▌ +4
3pm ▋ +5
4pm ▌ +4
5pm █▎ +10
6pm ██▏ +17
7pm ██▎ +18
8pm ███▏ +25

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML
Total stars: 12193
Stars trend:
8 Oct 2024
3pm +1
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
9 Oct 2024
12am +0
1am +0
2am ██████████████████▊ +162

#html
#artificialintelligencealgorithms, #artificialneuralnetworks, #bayesianstatistics, #computervision, #deeplearning, #deepneuralnetworks, #deepreinforcementlearning, #explainableai, #geometricdeeplearning, #graphneuralnetworks, #machinelearning, #medicalimaging, #naturallanguageprocessing, #optimization, #patternrecognition, #probabilisticgraphicalmodels, #probability, #reinforcementlearning, #speechrecognition, #visualrecognition
1
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
1am █▏ +9
2am ██▏ +17
3am █▎ +10
4am ▉ +7
5am ▊ +6
6am ▍ +3
7am ▌ +4
8am ▌ +4
9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
21 Jan 2025
6am ██▍ +19
7am ▎ +2
8am ▌ +4
9am ▍ +3
10am +0
11am ▋ +5
12pm ▌ +4
1pm ▊ +6
2pm █▋ +13
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
👍1
amanvirparhar/chaplin
A real-time silent speech recognition tool.
Language:Python
Total stars: 84
Stars trend:
3 Feb 2025
12am ▏ +1
1am █▍ +11
2am ▊ +6
3am ▌ +4
4am ▊ +6
5am █▏ +9
6am ▌ +4
7am ███▏ +25
8am █▎ +10

#python
#autoavsr, #avsr, #llm, #ollama, #speechrecognition, #speechtotext, #vsr
1
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
Total stars: 9523
Stars trend:
8 Apr 2025
2pm ▉ +7
3pm █▏ +9
4pm ▋ +5
5pm ▋ +5
6pm ▌ +4
7pm ▊ +6
8pm █ +8
9pm ▎ +2
10pm ▍ +3
11pm ▊ +6
9 Apr 2025
12am ▊ +6
1am ██▏ +17

#python
#audiovisualspeechrecognition, #conformer, #dfsmn, #paraformer, #pretrainedmodel, #punctuation, #pytorch, #rnnt, #speakerdiarization, #speechrecognition, #speechgpt, #speechllm, #vad, #voiceactivitydetection, #whisper
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
6 May 2025
4am ▊ +6
5am █▏ +9
6am ▊ +6
7am ▌ +4
8am █▏ +9
9am ▊ +6
10am ▋ +5
11am █▏ +9
12pm ▍ +3
1pm ▍ +3
2pm █▎ +10
3pm ▉ +7

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
8 May 2025
5am ▍ +3
6am ▍ +3
7am ▏ +1
8am ▍ +3
9am +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
1pm ▍ +3
2pm █▊ +14
3pm █▊ +14
4pm █ +8

#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers
1🔥1
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook
Total stars: 10057
Stars trend:
7 Jun 2025
7pm ▍ +3
8pm ▋ +5
9pm ▎ +2
10pm ▊ +6
11pm ▉ +7
8 Jun 2025
12am ▉ +7
1am ▉ +7
2am ▉ +7
3am █ +8
4am █▎ +10
5am ▋ +5
6am █▏ +9

#jupyternotebook
#android, #asr, #deeplearning, #deepneuralnetworks, #deepspeech, #googlespeechtotext, #ios, #kaldi, #offline, #privacy, #python, #raspberrypi, #speakeridentification, #speakerverification, #speechrecognition, #speechtotext, #speechtotextandroid, #stt, #voicerecognition, #vosk
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:
27 Jun 2025
8am ▍ +3
9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
1pm █▉ +15
2pm █▊ +14
3pm ▉ +7
4pm █ +8
5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp