speech-recognition

Star

Here are 4,974 public repositories matching this topic...

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Dec 27, 2024
Python

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Dec 24, 2024
C++

mozilla / DeepSpeech

Star

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

Updated Sep 3, 2024
C++

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Nov 20, 2024
TypeScript

kaldi-asr / kaldi

Star

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Nov 29, 2024
Shell

NVIDIA / DeepLearningExamples

Star

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

nlp translation computer-vision deep-learning mxnet tensorflow pytorch speech-synthesis speech-recognition forecasting drug-discovery recommender-systems paddlepaddle tensorflow2 large-language-models

Updated Aug 12, 2024
Jupyter Notebook

SYSTRAN / faster-whisper

Star

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Dec 23, 2024
Python

m-bain / whisperX

Star

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Dec 18, 2024
Python

kmario23 / deep-learning-drizzle

Star

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Updated Oct 19, 2024
HTML

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Dec 27, 2024
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Dec 20, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Dec 23, 2024
Python

Uberi / speech_recognition

Star

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated Dec 22, 2024
Python

alphacep / vosk-api

Star

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated Nov 13, 2024
Jupyter Notebook

nl8590687 / ASRT_SpeechRecognition

Star

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

python tensorflow keras cnn python3 speech-recognition speech-to-text ctc chinese-speech-recognition asrt

Updated Sep 26, 2024
Python

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Dec 27, 2024
C++

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Dec 26, 2024
Python

TalAter / annyang

Star

💬 Speech recognition for your site

voice speech speech-recognition speech-to-text

Updated Aug 7, 2024
JavaScript

flashlight / wav2letter

Star

Facebook AI Research's Automatic Speech Recognition Toolkit

deep-learning cpp end-to-end speech-recognition wav2letter

Updated Nov 23, 2024
C++

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Oct 18, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,974 public repositories matching this topic...

huggingface / transformers

ggerganov / whisper.cpp

mozilla / DeepSpeech

leon-ai / leon

kaldi-asr / kaldi

NVIDIA / DeepLearningExamples

SYSTRAN / faster-whisper

m-bain / whisperX

kmario23 / deep-learning-drizzle

PaddlePaddle / PaddleSpeech

speechbrain / speechbrain

espnet / espnet

Uberi / speech_recognition

alphacep / vosk-api

nl8590687 / ASRT_SpeechRecognition

openvinotoolkit / openvino

modelscope / FunASR

TalAter / annyang

flashlight / wav2letter

snakers4 / silero-models

Improve this page

Add this topic to your repo