-
-
TensorflowASR Public
Forked from Z-yq/TensorflowASR集成了Tensorflow 2版本的端到端语音识别模型,并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2
Python Apache License 2.0 UpdatedApr 24, 2022 -
DPSL-ASR Public
Forked from YUCHEN005/DPSL-ASRDual-Path Style Learning for End-to-End Noise-Robust Automatic Speech Recognition (DPSL-ASR).
-
transducer-loss-benchmarking Public
Forked from csukuangfj/transducer-loss-benchmarkingPython Other UpdatedMar 25, 2022 -
SpeechAlgorithms Public
Forked from Ryuk17/SpeechAlgorithmsSpeech Algorithms Collections
C Apache License 2.0 UpdatedMar 21, 2022 -
StreamingSpeakerDiarization Public
Forked from juanmc2005/diartOfficial open source implementation of the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"
Python MIT License UpdatedDec 21, 2021 -
auditok Public
Forked from amsehili/auditokAn audio/acoustic activity detection and audio segmentation tool
Python MIT License UpdatedNov 3, 2021 -
-
speechmetrics Public
Forked from aliutkus/speechmetricsA wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Python MIT License UpdatedOct 7, 2021 -
EfficientConformer Public
Forked from burchim/EfficientConformerEfficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
Python Apache License 2.0 UpdatedSep 17, 2021 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
-
chinese_text_normalization Public
Forked from speechio/chinese_text_normalizationChinese text normalization for speech processing
Python MIT License UpdatedSep 6, 2021 -
UHV-OTS-Speech Public
Forked from Appen/UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Forth Apache License 2.0 UpdatedSep 3, 2021 -
WavAugment Public
Forked from facebookresearch/WavAugmentA library for speech data augmentation in time-domain
Python MIT License UpdatedAug 30, 2021 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedMar 12, 2021 -
-
from_video_get_ASR_traindata Public
Forked from lezasantaizi/from_video_get_ASR_traindata这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成
Python UpdatedAug 5, 2018