speech-translation

Here are 68 public repositories matching this topic...

macairececile / speech-to-pictograms

Code from the paper "Towards Speech-to-Pictograms Translation" (Interspeech 2024)

machine-translation speech-recognition pictograms speech-translation interspeech2024 augmentative-and-alternative-communication

Updated Jan 29, 2025
Python

hagarz / Speech-to-text-translator

Star

Speech to text and translation client-server using Google cloud

python socket pyaudio translation gcp python3 google-api speech-to-text google-cloud-platform tcp-ip stream-audio google-cloud-translation-api speech-translation

Updated Dec 8, 2022
Python

TuAnh23 / MultiModalST

Star

Limit the use of end-to-end data for Speech Translation (by leveraging Automatic Speech Recognition and Machine Translation data instead) using zero-shot multilingual text translation techniques.

multi-modal zero-shot few-shot speech-translation

Updated May 16, 2022
Python

MohammadarefAhmadpoor / Speech-translation

Star

Speech recognition, language detection, translation, and speech synthesis

speech-synthesis speech-recognition speech-to-text speech-translation

Updated Jul 26, 2024
Python

Think-A-Move / SPEAR-SDK-Java-Linux

Star

SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Java for Linux

Updated Nov 26, 2021
Java

othneildrew / open-whisperer

Star

AI Video Translator and Subtitler

self-hosted speech-recognition video-captioning speech-translation audio-transcription whisper-api ai-tools video-translator multilingual-subtitles ai-transcriber ffmpeg-subtitles auto-subtitles

Updated May 31, 2025
TypeScript

Sharan-Kumar-R / Talk2Translate

Star

The application uses SpeechRecognition, GoogleTranslator, and gTTS to convert spoken English or Tamil into the opposite language, display the translated text, and play the audio output.

tts speech-recognition stt gtts bilingual googletrans speech-translation voice-translator deep-translator real-time-translation

Updated Nov 4, 2025
Python

Speech-To-Text is a C# desktop app that uses Azure Cognitive Services to convert and translate speech. You can copy or show the text on the screen, and choose the language of the speech or the translation.

c-sharp dotnet desktop-application speech-to-text azure-cognitive-services speech-translation

Updated Aug 8, 2023
C#

ksquarekumar / whisper-stream

Star

Whisper Transcription Service

deep-learning inference transformer openai automatic-speech-recognition flax speech-to-text whisper jax speech-translation speech-transcription

Updated Sep 14, 2023
Jupyter Notebook

mahshid1378 / NeMo

Star

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation speaker-diariazation generative-ai large-langage-models

Updated Mar 28, 2025
Python

Think-A-Move / SPEAR-SDK-Java-Windows

Star

SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Java for Windows

Updated Nov 26, 2021
Java

OgeNI / BVC_Challenging_Voice_Set

Star

A database of challenging voice utterances collected by the Biometrics Vision and Computing (BVC) group.

voice voice-recognition speech-recognition speech-to-text voice-detection voice-biometrics speech-translation speech-classification voice-dataset voice-datasets voice-data age-from-voice gender-from-voice voice-age voice-gender

Updated Mar 27, 2025

mt-upc / SegAugment

Star

SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

data-augmentation audio-segmentation speech-translation

Updated Jun 26, 2024
Python

OgeNI / BVC_Afro_Voice_data

Star

A database of Afro Voice utterances, BVC-Afro-Voice data, collected by the Biometrics Vision and Computing (BVC) group.

voice voice-recognition speech-recognition speech-to-text speech-translation voice-dataset voice-data bvc-voice afro-voice

Updated Mar 27, 2025

SiqiLii / Retrieve-and-Demonstration-ST

Star

Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach

speech-translation rare-word retrieval-augmented-translation

Updated Nov 13, 2024
Python

tran-khoa / joint-training-cascaded-st

Sponsor

Star

Code for the paper "Does Joint Training Really Help Cascaded Speech Translation?" (EMNLP 2022)

nlp fairseq speech-translation emnlp2022

Updated Oct 26, 2022
Python

andylee830914 / live_translation

Star

Simultaneous Speech-to-Text and Speech Translation using Azure AI.

translation azure speech-to-text transcription speech-translation

Updated Mar 2, 2025
Python

prashver / nlp-driven-video-summarizer-and-insight-tool

Star

An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.

natural-language-processing ntlk spacy named-entity-recognition flask-application text-summarization topic-modeling speech-to-text keyword-extraction speech-translation huggingface-transformers

Updated Aug 27, 2024
Python

mt-upc / iwslt-2022

Star

Systems submitted to IWSLT 2022 by the MT-UPC group.

translation adapters pretrained-models fine-tuning speech-translation speech-to-speech

Updated May 18, 2022
Python

ymoslem / Model-Compression

Star

Code for the papers: "Efficient Speech Translation through Model Compression and Knowledge Distillation" and "Iterative Layer Pruning for Efficient Translation Inference"

machine-translation model-compression speech-translation layer-pruning

Updated Sep 24, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-translation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-translation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-translation

Here are 68 public repositories matching this topic...

macairececile / speech-to-pictograms

hagarz / Speech-to-text-translator

TuAnh23 / MultiModalST

MohammadarefAhmadpoor / Speech-translation

Think-A-Move / SPEAR-SDK-Java-Linux

othneildrew / open-whisperer

Sharan-Kumar-R / Talk2Translate

yousef0sa / Speech-To-Text

ksquarekumar / whisper-stream

mahshid1378 / NeMo

Think-A-Move / SPEAR-SDK-Java-Windows

OgeNI / BVC_Challenging_Voice_Set

mt-upc / SegAugment

OgeNI / BVC_Afro_Voice_data

SiqiLii / Retrieve-and-Demonstration-ST

tran-khoa / joint-training-cascaded-st

andylee830914 / live_translation

prashver / nlp-driven-video-summarizer-and-insight-tool

mt-upc / iwslt-2022

ymoslem / Model-Compression

Improve this page

Add this topic to your repo