speech-translation

Here are 46 public repositories matching this topic...

ReneeYe / XSTNet

This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)

machine-translation speech-recognition neural-machine-translation spoken-language-processing tensorflow2 speech-translation interspeech2021

Updated May 1, 2022
Python

TuAnh23 / MultiModalST

Star

Limit the use of end-to-end data for Speech Translation (by leveraging Automatic Speech Recognition and Machine Translation data instead) using zero-shot multilingual text translation techniques.

multi-modal zero-shot few-shot speech-translation

Updated May 16, 2022
Python

mt-upc / iwslt-2022

Star

Systems submitted to IWSLT 2022 by the MT-UPC group.

translation adapters pretrained-models fine-tuning speech-translation speech-to-speech

Updated May 18, 2022
Python

ReneeYe / ConST

Star

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

translation machine-translation pytorch transformer neural-machine-translation spoken-language-processing speec speech-translation contrastive-learning naacl2022

Updated May 25, 2022
Python

George0828Zhang / simulst

Star

PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.

streaming pytorch speech-recognition speech-to-text speech-translation simultaneous-translation

Updated Oct 3, 2022
Python

tran-khoa / joint-training-cascaded-st

Sponsor

Star

Code for the paper "Does Joint Training Really Help Cascaded Speech Translation?" (EMNLP 2022)

nlp fairseq speech-translation emnlp2022

Updated Oct 26, 2022
Python

ictnlp / ITST

Star

Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"

machine-translation speech-translation simultaneous-translation end-to-end-speech-translation simultaneous-machine-translation

Updated Nov 3, 2022
Python

hagarz / Speech-to-text-translator

Star

Speech to text and translation client-server using Google cloud

python socket pyaudio translation gcp python3 google-api speech-to-text google-cloud-platform tcp-ip stream-audio google-cloud-translation-api speech-translation

Updated Dec 8, 2022
Python

mt-upc / SHAS

Star

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

speech speech-to-text audio-segmentation speech-translation wav2vec2

Updated Feb 9, 2023
Python

bzhangGo / st_from_scratch

Star

Revisiting End-to-End Speech-to-Text Translation From Scratch

speech-translation end-to-end-speech-translation speech-to-text-translation speech-translation-from-scratch

Updated Feb 21, 2023
Python

bzhangGo / zero

Star

Zero -- A neural machine translation system

transformer neural-machine-translation average-attention-network aan speech-translation depth-scaled-initialization deep-transformer l0drop adaptive-feature-selection massively-multilingual-translation opus-100 fast-bidirectional-decoder

Updated May 8, 2023
Python

xuchennlp / S2T

Star

The project for speech translation

speech-recognition speech-to-text speech-translation

Updated Sep 28, 2023
Python

ictnlp / STEMM

Star

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

machine-translation speech-to-text speech-translation

Updated Oct 25, 2023
Python

ictnlp / CRESS

Star

Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".

machine-translation speech-to-text speech-translation

Updated Oct 25, 2023
Python

ictnlp / BT4ST

Star

Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".

machine-translation speech-to-text speech-translation

Updated Oct 25, 2023
Python

yaya-sy / speechscorer

Star

unsupervised spoken utterances scoring

speech speech-recognition whisper self-supervised-learning speech-translation hubert

Updated Nov 21, 2023
Python

ictnlp / DiSeg

Star

Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"

segment streaming machine-translation speech segmentation sequence-segmentation speech-translation simultaneous-translation simultaneous-machine-translation streaming-speech-to-text

Updated Dec 6, 2023
Python

Dadangdut33 / Speech-Translate

Star

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python translate whisper tkinter-python speech-translation speech-transcription

Updated Jan 18, 2024
Python

George0828Zhang / torch_cif

Star

A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.

speech torch pytorch speech-recognition alignment automatic-speech-recognition speech-to-text cif asr monotonic speech-translation continuous-integrate-and-fire

Updated Feb 10, 2024
Python

microsoft / SpeechT5

Star

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

speech-synthesis speech-recognition speech-translation speech-pretraining speecht5 speech2c speechlm speechut speech-text-pretraining vatlm vallex

Updated Apr 24, 2024
Python

Improve this page

Add a description, image, and links to the speech-translation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-translation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-translation

Here are 46 public repositories matching this topic...

ReneeYe / XSTNet

TuAnh23 / MultiModalST

mt-upc / iwslt-2022

ReneeYe / ConST

George0828Zhang / simulst

tran-khoa / joint-training-cascaded-st

ictnlp / ITST

hagarz / Speech-to-text-translator

mt-upc / SHAS

bzhangGo / st_from_scratch

bzhangGo / zero

xuchennlp / S2T

ictnlp / STEMM

ictnlp / CRESS

ictnlp / BT4ST

yaya-sy / speechscorer

ictnlp / DiSeg

Dadangdut33 / Speech-Translate

George0828Zhang / torch_cif

microsoft / SpeechT5

Improve this page

Add this topic to your repo