#

asr

Here are 1,035 public repositories matching this topic...

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated Jul 3, 2024
Python

PeterH0323 / Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

chat chatbot text-generation tts gpt chat-application asr rag digital-human llm chatgpt internlm-chat-7b internlm2 meta-human

Updated Jul 3, 2024
Python

platform

voicegain / platform

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

deep-neural-networks ivr speech-to-text rtc transcription asr mrcp

Updated Jul 3, 2024
HTML

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jul 3, 2024
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Jul 3, 2024
Python

bharathraj-v / fastconformer-ctc-telugu

NVIDIA NeMo's stt_en_fastconformer_ctc_large finetuned on open-source telugu data for Automatic Speech Recognition

deep-learning speech-recognition asr

Updated Jul 2, 2024
Jupyter Notebook

tensorflow / lingvo

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Jul 2, 2024
Python

deepgram / deepgram-python-sdk

Official Python SDK for Deepgram's automated speech recognition APIs.

python speech-recognition hacktoberfest asr deepgram automated-speech-recognition

Updated Jul 2, 2024
Python

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Jul 2, 2024
Python

deepgram / deepgram-js-sdk

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

javascript typescript ai speech-recognition speech-to-text hacktoberfest asr deepgram automated-speech-recognition

Updated Jul 2, 2024
TypeScript

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated Jul 3, 2024
Python

winstxnhdw / CapGen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.

docker caddy automatic-speech-recognition whisper asr fastapi uvicorn-gunicorn huggingface huggingface-spaces ctranslate2

Updated Jul 2, 2024
Python

rwth-i6 / rasr

The RWTH ASR Toolkit.

asr

Updated Jul 2, 2024
C++

flozi00 / atra

An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands

chatbot speech transformers inference speech-recognition asr llm stable-diffusion

Updated Jul 2, 2024
Jupyter Notebook

KevKibe / African-Whisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Jul 2, 2024
Python

double22a / speech_dataset

The dataset of Speech Recognition

audio text-to-speech deep-neural-networks deep-learning speech tts speech-synthesis dataset wav speech-recognition automatic-speech-recognition speech-to-text voice-conversion asr speech-separation speech-enhancement speech-segmentation speech-translation speech-diarization

Updated Jul 2, 2024

innerNULL / mia

My Implementations' Archive

audio nlp training crawler machine-learning youtube deep-learning corpus youtube-dl dataset youtube-downloader data-collection asr paper-implementations youtube-crawler

Updated Jul 2, 2024
Python

mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Jul 1, 2024
C++

ieasybooks / tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

python youtube facebook twitter soundcloud subtitles srt vtt automatic-speech-recognition whisper asr stable-whisper faster-whisper ctranslate2

Updated Jul 1, 2024
Python

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated Jul 1, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."