huggingface/transformers
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language: Python
Stars trend:
2 Apr 2023
#python
#bert, #deeplearning, #flax, #hacktoberfest, #jax, #languagemodel, #languagemodels, #machinelearning, #modelhub, #naturallanguageprocessing, #nlp, #nlplibrary, #pretrainedmodels, #python, #pytorch, #pytorchtransformers, #seq2seq, #speechrecognition, #tensorflow, #transformer
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language: Python
Stars trend:
2 Apr 2023
7am โโ 10
8am โโโ 17
9am โโ 16
10am โโ 15
11am โ 8
12pm โโ 9
1pm โ 8
2pm โโโ 18
3pm โโ 12
4pm โโ 9
5pm โโ 15
6pm โโ 16
#python
#bert, #deeplearning, #flax, #hacktoberfest, #jax, #languagemodel, #languagemodels, #machinelearning, #modelhub, #naturallanguageprocessing, #nlp, #nlplibrary, #pretrainedmodels, #python, #pytorch, #pytorchtransformers, #seq2seq, #speechrecognition, #tensorflow, #transformer
toverainc/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Language: C
Total stars: 255
Stars trend:
15 May 2023
#c
#alexa, #deeplearning, #echo, #espadf, #espidf, #esp32, #googlehome, #homeassistant, #homeautomation, #privacy, #speechrecognition, #speechtotext, #whisper
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Language: C
Total stars: 255
Stars trend:
15 May 2023
8am โ +1
9am +0
10am +0
11am +0
12pm +0
1pm +0
2pm โโโโโโ +45
3pm โโโโโโโโโโโโ +93
4pm โโโโโ +40
#c
#alexa, #deeplearning, #echo, #espadf, #espidf, #esp32, #googlehome, #homeassistant, #homeautomation, #privacy, #speechrecognition, #speechtotext, #whisper
guillaumekln/faster-whisper
Faster Whisper transcription with CTranslate2
Language: Python
Total stars: 3284
Stars trend:
19 Jul 2023
#python
#deeplearning, #inference, #openai, #quantization, #speechrecognition, #speechtotext, #transformer, #whisper
Faster Whisper transcription with CTranslate2
Language: Python
Total stars: 3284
Stars trend:
19 Jul 2023
1am โ +3
2am โ +3
3am โ +2
4am โโโ +20
5am โโโโโโโ +54
6am โโโโโโ +41
7am โโโ +17
8am โโโ +23
9am โโ +11
10am โ +8
11am โ +7
12pm โโ +11
#python
#deeplearning, #inference, #openai, #quantization, #speechrecognition, #speechtotext, #transformer, #whisper
huggingface/distil-whisper
Total stars: 170
Stars trend:
31 Oct 2023
#audio, #speechrecognition, #whisper
Total stars: 170
Stars trend:
31 Oct 2023
5pm โ +5
6pm โโโโโ +37
7pm โโโโ +27
8pm โโโโ +27
9pm โโโ +19
10pm โโโ +24
11pm โโโ +20
#audio, #speechrecognition, #whisper
jianchang512/stt
Voice Recognition to Text Tool / ไธไธช็ฆป็บฟ่ฟ่ก็ๆฌๅฐ่ฏญ้ณ่ฏๅซ่ฝฌๆๅญๆๅก๏ผ่พๅบjsonใsrtๅญๅนๅธฆๆถ้ดๆณใ็บฏๆๅญๆ ผๅผ
Language:Python
Total stars: 233
Stars trend:
#python
#speech, #speechrecognition, #speechtotext, #stt
Voice Recognition to Text Tool / ไธไธช็ฆป็บฟ่ฟ่ก็ๆฌๅฐ่ฏญ้ณ่ฏๅซ่ฝฌๆๅญๆๅก๏ผ่พๅบjsonใsrtๅญๅนๅธฆๆถ้ดๆณใ็บฏๆๅญๆ ผๅผ
Language:Python
Total stars: 233
Stars trend:
5 Jan 2024
12am โ +6
1am โโโโโ +36
2am โโโโ +28
3am โโโ +19
4am โ +8
5am โโ +13
6am โโโ +17
7am โโ +12
8am โ +5
9am โ +8
#python
#speech, #speechrecognition, #speechtotext, #stt
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python
Total stars: 7392
Stars trend:
#python
#asr, #audio, #audioprocessing, #deeplearning, #huggingface, #languagemodel, #pytorch, #speakerdiarization, #speakerrecognition, #speakerverification, #speechenhancement, #speechprocessing, #speechrecognition, #speechseparation, #speechtotext, #speechtoolkit, #speechrecognition, #spokenlanguageunderstanding, #transformers, #voicerecognition
A PyTorch-based Speech Toolkit
Language:Python
Total stars: 7392
Stars trend:
28 Feb 2024
2pm โ +1
3pm โโ +16
4pm โโ +12
5pm โโ +13
6pm โโ +12
7pm โ +7
#python
#asr, #audio, #audioprocessing, #deeplearning, #huggingface, #languagemodel, #pytorch, #speakerdiarization, #speakerrecognition, #speakerverification, #speechenhancement, #speechprocessing, #speechrecognition, #speechseparation, #speechtotext, #speechtoolkit, #speechrecognition, #spokenlanguageunderstanding, #transformers, #voicerecognition
alibaba-damo-academy/FunClip
ไธๆฌพๅบไบFunASR้ซๅ็กฎ็ๅผๆบ่ฏญ้ณ่ฏๅซๆจกๅ็ๆบ่ฝ่ง้ขๅช่พๅทฅๅ ท / A video clipping tool based on FunASR open source model and Gradio.
Language:Shell
Total stars: 411
Stars trend:
#shell
#speechrecognition, #subtitlesgenerator, #videoclip, #videosubtitles
ไธๆฌพๅบไบFunASR้ซๅ็กฎ็ๅผๆบ่ฏญ้ณ่ฏๅซๆจกๅ็ๆบ่ฝ่ง้ขๅช่พๅทฅๅ ท / A video clipping tool based on FunASR open source model and Gradio.
Language:Shell
Total stars: 411
Stars trend:
21 Apr 2024
6am โ +6
7am โ +6
8am โ +8
9am โ +5
10am โ +6
11am โ +4
12pm โ +4
1pm โ +6
2pm โโ +11
3pm โโ +12
4pm โ +4
5pm โ +3
#shell
#speechrecognition, #subtitlesgenerator, #videoclip, #videosubtitles
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:
#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:
26 May 2024
7pm โ +6
8pm โโโโโโ +43
9pm โโโโ +31
10pm โโโ +22
#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx
ictnlp/StreamSpeech
StreamSpeech is an โAll in Oneโ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
StreamSpeech is an โAll in Oneโ seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm โ +1
10pm โ +1
11pm โ +2
18 Jun 2024
12am +0
1am โ +5
2am โ +3
3am โโ +11
4am โโโ +24
5am โโ +13
6am โ +8
7am โ +7
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
21 Jun 2024
12am โ +3
1am โโ +16
2am โโ +14
3am โ +8
4am โโ +14
5am โโโ +18
6am โโโ +18
7am โโโ +19
#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
yeyupiaoling/MASR
Pytorchๅฎ็ฐ็ๆตๅผไธ้ๆตๅผ็่ชๅจ่ฏญ้ณ่ฏๅซๆกๆถ๏ผๅๆถๅ ผๅฎนๅจ็บฟๅ็ฆป็บฟ่ฏๅซ๏ผ็ฎๅๆฏๆConformerใSqueezeformerใDeepSpeech2ๆจกๅ๏ผๆฏๆๅค็งๆฐๆฎๅขๅผบๆนๆณใ
Language:Python
Total stars: 580
Stars trend:
#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer
Pytorchๅฎ็ฐ็ๆตๅผไธ้ๆตๅผ็่ชๅจ่ฏญ้ณ่ฏๅซๆกๆถ๏ผๅๆถๅ ผๅฎนๅจ็บฟๅ็ฆป็บฟ่ฏๅซ๏ผ็ฎๅๆฏๆConformerใSqueezeformerใDeepSpeech2ๆจกๅ๏ผๆฏๆๅค็งๆฐๆฎๅขๅผบๆนๆณใ
Language:Python
Total stars: 580
Stars trend:
3 Aug 2024
2pm โโโโโโโโโโโโ +90
3pm +0
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
4 Aug 2024
12am +0
1am โ +1
#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
3 Sep 2024
9am โ +1
10am +0
11am +0
12pm โ +1
1pm โ +1
2pm โ +4
3pm โ +5
4pm โ +4
5pm โโ +10
6pm โโโ +17
7pm โโโ +18
8pm โโโโ +25
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML
Total stars: 12193
Stars trend:
#html
#artificialintelligencealgorithms, #artificialneuralnetworks, #bayesianstatistics, #computervision, #deeplearning, #deepneuralnetworks, #deepreinforcementlearning, #explainableai, #geometricdeeplearning, #graphneuralnetworks, #machinelearning, #medicalimaging, #naturallanguageprocessing, #optimization, #patternrecognition, #probabilisticgraphicalmodels, #probability, #reinforcementlearning, #speechrecognition, #visualrecognition
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML
Total stars: 12193
Stars trend:
8 Oct 2024
3pm +1
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
9 Oct 2024
12am +0
1am +0
2am โโโโโโโโโโโโโโโโโโโ +162
#html
#artificialintelligencealgorithms, #artificialneuralnetworks, #bayesianstatistics, #computervision, #deeplearning, #deepneuralnetworks, #deepreinforcementlearning, #explainableai, #geometricdeeplearning, #graphneuralnetworks, #machinelearning, #medicalimaging, #naturallanguageprocessing, #optimization, #patternrecognition, #probabilisticgraphicalmodels, #probability, #reinforcementlearning, #speechrecognition, #visualrecognition
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm โ +1
11pm โ +4
10 Nov 2024
12am โ +2
1am โโ +9
2am โโโ +17
3am โโ +10
4am โ +7
5am โ +6
6am โ +3
7am โ +4
8am โ +4
9am โ +8
#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
21 Jan 2025
6am โโโ +19
7am โ +2
8am โ +4
9am โ +3
10am +0
11am โ +5
12pm โ +4
1pm โ +6
2pm โโ +13
3pm โ +7
4pm โ +7
5pm โ +6
#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
amanvirparhar/chaplin
A real-time silent speech recognition tool.
Language:Python
Total stars: 84
Stars trend:
#python
#autoavsr, #avsr, #llm, #ollama, #speechrecognition, #speechtotext, #vsr
A real-time silent speech recognition tool.
Language:Python
Total stars: 84
Stars trend:
3 Feb 2025
12am โ +1
1am โโ +11
2am โ +6
3am โ +4
4am โ +6
5am โโ +9
6am โ +4
7am โโโโ +25
8am โโ +10
#python
#autoavsr, #avsr, #llm, #ollama, #speechrecognition, #speechtotext, #vsr
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
Total stars: 9523
Stars trend:
#python
#audiovisualspeechrecognition, #conformer, #dfsmn, #paraformer, #pretrainedmodel, #punctuation, #pytorch, #rnnt, #speakerdiarization, #speechrecognition, #speechgpt, #speechllm, #vad, #voiceactivitydetection, #whisper
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
Total stars: 9523
Stars trend:
8 Apr 2025
2pm โ +7
3pm โโ +9
4pm โ +5
5pm โ +5
6pm โ +4
7pm โ +6
8pm โ +8
9pm โ +2
10pm โ +3
11pm โ +6
9 Apr 2025
12am โ +6
1am โโโ +17
#python
#audiovisualspeechrecognition, #conformer, #dfsmn, #paraformer, #pretrainedmodel, #punctuation, #pytorch, #rnnt, #speakerdiarization, #speechrecognition, #speechgpt, #speechllm, #vad, #voiceactivitydetection, #whisper
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
6 May 2025
4am โ +6
5am โโ +9
6am โ +6
7am โ +4
8am โโ +9
9am โ +6
10am โ +5
11am โโ +9
12pm โ +3
1pm โ +3
2pm โโ +10
3pm โ +7
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
8 May 2025
5am โ +3
6am โ +3
7am โ +1
8am โ +3
9am +0
10am โ +5
11am โโ +11
12pm โโ +11
1pm โ +3
2pm โโ +14
3pm โโ +14
4pm โ +8
#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook
Total stars: 10057
Stars trend:
#jupyternotebook
#android, #asr, #deeplearning, #deepneuralnetworks, #deepspeech, #googlespeechtotext, #ios, #kaldi, #offline, #privacy, #python, #raspberrypi, #speakeridentification, #speakerverification, #speechrecognition, #speechtotext, #speechtotextandroid, #stt, #voicerecognition, #vosk
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook
Total stars: 10057
Stars trend:
7 Jun 2025
7pm โ +3
8pm โ +5
9pm โ +2
10pm โ +6
11pm โ +7
8 Jun 2025
12am โ +7
1am โ +7
2am โ +7
3am โ +8
4am โโ +10
5am โ +5
6am โโ +9
#jupyternotebook
#android, #asr, #deeplearning, #deepneuralnetworks, #deepspeech, #googlespeechtotext, #ios, #kaldi, #offline, #privacy, #python, #raspberrypi, #speakeridentification, #speakerverification, #speechrecognition, #speechtotext, #speechtotextandroid, #stt, #voicerecognition, #vosk