Alex-Songs

Alex-Songs

18 followers · 157 following

Achievements

ASR-Rescoring Public
Forked from ishine/ASR-Rescoring

Python Updated Apr 26, 2022
TensorflowASR Public
Forked from Z-yq/TensorflowASR

集成了Tensorflow 2版本的端到端语音识别模型，并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2

Python Apache License 2.0 Updated Apr 24, 2022
DPSL-ASR Public
Forked from YUCHEN005/DPSL-ASR

Dual-Path Style Learning for End-to-End Noise-Robust Automatic Speech Recognition (DPSL-ASR).

Python 1 Apache License 2.0 Updated Apr 17, 2022
transducer-loss-benchmarking Public
Forked from csukuangfj/transducer-loss-benchmarking

Python Other Updated Mar 25, 2022
SpeechAlgorithms Public
Forked from Ryuk17/SpeechAlgorithms

Speech Algorithms Collections

C Apache License 2.0 Updated Mar 21, 2022
StreamingSpeakerDiarization Public
Forked from juanmc2005/diart

Official open source implementation of the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Python MIT License Updated Dec 21, 2021
auditok Public
Forked from amsehili/auditok

An audio/acoustic activity detection and audio segmentation tool

Python MIT License Updated Nov 3, 2021
git-tips Public
Forked from 521xueweihan/git-tips

Git的奇技淫巧

Updated Nov 2, 2021
speechmetrics Public
Forked from aliutkus/speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python MIT License Updated Oct 7, 2021
EfficientConformer Public
Forked from burchim/EfficientConformer

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Python Apache License 2.0 Updated Sep 17, 2021
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Python 1 Apache License 2.0 Updated Sep 7, 2021
chinese_text_normalization Public
Forked from speechio/chinese_text_normalization

Chinese text normalization for speech processing

Python MIT License Updated Sep 6, 2021
UHV-OTS-Speech Public
Forked from Appen/UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Forth Apache License 2.0 Updated Sep 3, 2021
WavAugment Public
Forked from facebookresearch/WavAugment

A library for speech data augmentation in time-domain

Python MIT License Updated Aug 30, 2021
espnet Public
Forked from espnet/espnet

End-to-End Speech Processing Toolkit

Python Apache License 2.0 Updated Mar 12, 2021
TS-VAD Public
Forked from dodohow1011/TS-VAD

Python Updated Jan 15, 2021
from_video_get_ASR_traindata Public
Forked from lezasantaizi/from_video_get_ASR_traindata

这个工程的目的是从视频中获取语音识别的训练数据，用于训练字幕自动生成

Python Updated Aug 5, 2018

Alex-Songs

Achievements

Achievements

ASR-Rescoring Public

Uh oh!

TensorflowASR Public

Uh oh!

DPSL-ASR Public

Uh oh!

transducer-loss-benchmarking Public

Uh oh!

SpeechAlgorithms Public

Uh oh!

StreamingSpeakerDiarization Public

Uh oh!

auditok Public

Uh oh!

git-tips Public

Uh oh!

speechmetrics Public

Uh oh!

EfficientConformer Public

Uh oh!

transformers Public

Uh oh!

chinese_text_normalization Public

Uh oh!

UHV-OTS-Speech Public

Uh oh!

WavAugment Public

Uh oh!

espnet Public

Uh oh!

TS-VAD Public

Uh oh!

from_video_get_ASR_traindata Public

Uh oh!