speech-translation

Here are 46 public repositories matching this topic...

espnet / espnet

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Nov 12, 2025
Python

NVIDIA-NeMo / NeMo

Star

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated Nov 12, 2025
Python

PalabraAI / palabra-ai-python

Star

Python SDK for Palabra AI's real-time speech-to-speech translation API. Break down language barriers and enable seamless communication across 25+ languages

translation languages seamless speech-translation s2st

Updated Nov 11, 2025
Python

Sharan-Kumar-R / Talk2Translate

Star

The application uses SpeechRecognition, GoogleTranslator, and gTTS to convert spoken English or Tamil into the opposite language, display the translated text, and play the audio output.

tts speech-recognition stt gtts bilingual googletrans speech-translation voice-translator deep-translator real-time-translation

Updated Nov 4, 2025
Python

hlt-mt / FBK-fairseq

Star

Repository containing the open source code of works published at the FBK MT unit.

deep-learning pytorch speech-to-text subtitling gender-bias speech-translation simultaneous-translation

Updated Nov 1, 2025
Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Oct 20, 2025
Python

chentuochao / Spatial-Speech-Translation

Star

The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"

spatial-audio speech-separation speech-translation

Updated Aug 15, 2025
Python

ictnlp / StreamUni

Star

StreamUni is a framework that efficiently enables unified Large Speech-Language Models to accomplish streaming speech translation in a cohesive manner.

speech-recognition speech-to-text speech-processing multimodal speech-translation simultaneous-translation large-language-models llms simultaneous-machine-translation multimodal-large-language-models streaming-generation phi4-multimodal speech-language-models speeech-llms

Updated Jul 14, 2025
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 29, 2025
Python

mct10 / IWSLT2025_LowRes_ST

Star

Code for GMU's submission to IWSLT 2025 Low-Resource Speech Translation Shared Task

machine-translation speech-translation

Updated May 29, 2025
Python

steventan0110 / STAR

Star

Official Repository for our IWSLT 2025 paper "Streaming Sequence Transduction through Dynamic Compression"

speech-translation simultaneous-translation

Updated May 22, 2025
Python

The-Data-Dilemma / ParquetToHuggingFace

Star

ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.

data-science pandas python3 dataset speech-recognition data-analysis parquet automatic-speech-recognition speech-to-text parquet-generator healthcare-application audio-processing speech-data speech-translation huggingface audio-dataset huggingface-datasets

Updated May 16, 2025
Python

huggingface / speech-to-speech

Star

Speech To Speech: an effort for an open-sourced and modular GPT4-o

python machine-learning ai speech speech-synthesis assistant speech-to-text language-model speech-translation

Updated Apr 15, 2025
Python

mahshid1378 / NeMo

Star

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation speaker-diariazation generative-ai large-langage-models

Updated Mar 28, 2025
Python

andylee830914 / live_translation

Star

Simultaneous Speech-to-Text and Speech Translation using Azure AI.

translation azure speech-to-text transcription speech-translation

Updated Mar 2, 2025
Python

KevKibe / African-Whisper

Sponsor

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Feb 27, 2025
Python

macairececile / speech-to-pictograms

Star

Code from the paper "Towards Speech-to-Pictograms Translation" (Interspeech 2024)

machine-translation speech-recognition pictograms speech-translation interspeech2024 augmentative-and-alternative-communication

Updated Jan 29, 2025
Python

liamdugan / speech-to-speech

Star

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-processing speech-translation speech-to-speech simultaneous-translation

Updated Jan 14, 2025
Python

MooreThreads / MooER

Star

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

speech-recognition speech-to-text speech-translation speech-to-speech large-language-models chatgpt gpt-4o speech-interaction

Updated Jan 8, 2025
Python

mt-upc / ZeroSwot

Star

Pushing the Limits of Zero-shot End-to-End Speech Translation

translation speech-translation

Updated Dec 12, 2024
Python

Improve this page

Add a description, image, and links to the speech-translation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-translation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-translation

Here are 46 public repositories matching this topic...

espnet / espnet

NVIDIA-NeMo / NeMo

PalabraAI / palabra-ai-python

Sharan-Kumar-R / Talk2Translate

hlt-mt / FBK-fairseq

PaddlePaddle / PaddleSpeech

chentuochao / Spatial-Speech-Translation

ictnlp / StreamUni

ictnlp / StreamSpeech

mct10 / IWSLT2025_LowRes_ST

steventan0110 / STAR

The-Data-Dilemma / ParquetToHuggingFace

huggingface / speech-to-speech

mahshid1378 / NeMo

andylee830914 / live_translation

KevKibe / African-Whisper

macairececile / speech-to-pictograms

liamdugan / speech-to-speech

MooreThreads / MooER

mt-upc / ZeroSwot

Improve this page

Add this topic to your repo