speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python
Total stars: 7392
Stars trend:
#python
#asr, #audio, #audioprocessing, #deeplearning, #huggingface, #languagemodel, #pytorch, #speakerdiarization, #speakerrecognition, #speakerverification, #speechenhancement, #speechprocessing, #speechrecognition, #speechseparation, #speechtotext, #speechtoolkit, #speechrecognition, #spokenlanguageunderstanding, #transformers, #voicerecognition
A PyTorch-based Speech Toolkit
Language:Python
Total stars: 7392
Stars trend:
28 Feb 2024
2pm ▏ +1
3pm ██ +16
4pm █▌ +12
5pm █▋ +13
6pm █▌ +12
7pm ▉ +7
#python
#asr, #audio, #audioprocessing, #deeplearning, #huggingface, #languagemodel, #pytorch, #speakerdiarization, #speakerrecognition, #speakerverification, #speechenhancement, #speechprocessing, #speechrecognition, #speechseparation, #speechtotext, #speechtoolkit, #speechrecognition, #spokenlanguageunderstanding, #transformers, #voicerecognition
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am +0
1am ▋ +5
2am ▍ +3
3am █▍ +11
4am ███ +24
5am █▋ +13
6am █ +8
7am ▉ +7
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
cgzirim/not-shazam
An implementation of Shazam's song matching algorithm.
Language:Go
Total stars: 159
Stars trend:
#go
#audiofingerprinting, #audioprocessing, #go, #golang, #notshazam, #shazam, #song, #songrecognitionalgorithm
An implementation of Shazam's song matching algorithm.
Language:Go
Total stars: 159
Stars trend:
1 Aug 2024
2am ▏ +1
3am ▌ +4
4am ▏ +1
5am ▎ +2
6am ▏ +1
7am ▎ +2
8am ▎ +2
9am +0
10am ▌ +4
11am ▌ +4
12pm ████ +32
1pm ████████▎ +66
#go
#audiofingerprinting, #audioprocessing, #go, #golang, #notshazam, #shazam, #song, #songrecognitionalgorithm
cgzirim/seek-tune
An implementation of Shazam's song matching algorithm.
Language:Go
Total stars: 990
Stars trend:
#go
#audiofingerprinting, #audioprocessing, #go, #golang, #notshazam, #shazam, #song, #songrecognitionalgorithm
An implementation of Shazam's song matching algorithm.
Language:Go
Total stars: 990
Stars trend:
2 Aug 2024
11am ▉ +7
12pm ███▊ +30
1pm ██▎ +18
2pm █▍ +11
3pm █▎ +10
#go
#audiofingerprinting, #audioprocessing, #go, #golang, #notshazam, #shazam, #song, #songrecognitionalgorithm
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
Language:C
Total stars: 2706
Stars trend:
#c
#audio, #audioanalysis, #audiofeatures, #audioprocessing, #deeplearning, #machinelearning, #mfcc, #mir, #music, #musicanalysis, #musicinformationretrieval, #pitch, #python, #signalprocessing, #spectralanalysis, #spectrogram, #timefrequencyanalysis, #waveletanalysis, #wavelettransform
A library for audio and music analysis, feature extraction.
Language:C
Total stars: 2706
Stars trend:
13 Aug 2024
2pm ██████▏ +49
3pm ████████ +64
4pm ████▉ +39
5pm ████▏ +33
6pm ███▏ +25
7pm ██▊ +22
8pm ██▌ +20
9pm ██ +16
10pm █▎ +10
#c
#audio, #audioanalysis, #audiofeatures, #audioprocessing, #deeplearning, #machinelearning, #mfcc, #mir, #music, #musicanalysis, #musicinformationretrieval, #pitch, #python, #signalprocessing, #spectralanalysis, #spectrogram, #timefrequencyanalysis, #waveletanalysis, #wavelettransform
FunAudioLLM/InspireMusic
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
Language:Python
Total stars: 446
Stars trend:
#python
#audiogeneration, #audioprocessing, #musicgeneration, #pytorch
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
Language:Python
Total stars: 446
Stars trend:
11 Feb 2025
3am ███▌ +28
4am █▎ +10
5am ▋ +5
6am █ +8
7am █ +8
8am ▋ +5
9am █▍ +11
#python
#audiogeneration, #audioprocessing, #musicgeneration, #pytorch
cgzirim/seek-tune
An implementation of Shazam's song recognition algorithm.
Language:Go
Total stars: 2112
Stars trend:
#go
#audiofingerprinting, #audioprocessing, #go, #golang, #notshazam, #shazam, #song, #songrecognitionalgorithm
An implementation of Shazam's song recognition algorithm.
Language:Go
Total stars: 2112
Stars trend:
28 Feb 2025
9pm ▍ +3
10pm +0
11pm ▎ +2
1 Mar 2025
12am ▍ +3
1am ▌ +4
2am ▏ +1
3am ▎ +2
4am █▏ +9
5am █▌ +12
6am ▊ +6
7am ██▏ +17
8am ██ +16
#go
#audiofingerprinting, #audioprocessing, #go, #golang, #notshazam, #shazam, #song, #songrecognitionalgorithm
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
8 May 2025
5am ▍ +3
6am ▍ +3
7am ▏ +1
8am ▍ +3
9am +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
1pm ▍ +3
2pm █▊ +14
3pm █▊ +14
4pm █ +8
#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers