Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 319 25 Updated Dec 22, 2025

Tele-AI / TeleSpeech-ASR

Python 849 79 Updated Jun 7, 2024

haliaeetus / iso-639

ISO 639 language codes with names in English, French & German, provided in JSON & CSV or as a NodeJS module.

JavaScript 89 70 Updated Aug 26, 2022

saffsd / langid.py

Stand-alone language identification system

Python 2,462 321 Updated Jan 1, 2020

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,912 161 Updated Feb 25, 2026

Mirantis / cri-dockerd

dockerd as a compliant Container Runtime Interface for Kubernetes

Go 1,354 350 Updated Jun 2, 2026

KeSpeech / KeSpeech

The repo provides information about KeSpeech dataset.

178 11 Updated Oct 13, 2022

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,228 6,679 Updated Sep 30, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 102,951 12,553 Updated Apr 15, 2026

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 22,549 2,308 Updated Jun 3, 2026

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 81,324 15,240 Updated May 10, 2024

FunAudioLLM / SenseVoice

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Python 8,601 784 Updated Jun 9, 2026

Syllo / nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 10,751 396 Updated May 6, 2026

yeyupiaoling / Whisper-Finetune

C 1,217 219 Updated May 8, 2026

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Python 18,272 1,865 Updated Jun 17, 2026

microsoft / MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, …

Python 5,804 958 Updated Aug 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ts0923

Block or report ts0923

Stars

hectorqin / reader

maotoumao / MusicFree

wenet-e2e / wespeaker

ZongxianLee / MMD_Loss.Pytorch

zyzisyz / mfa_conformer

SSARCandy / DeepCORAL

pmorerio / minimal-entropy-correlation-alignment

VisionLearningGroup / CORAL

jackaduma / CycleGAN-VC3

jackaduma / CycleGAN-VC2

shackysureshot / StarGAN-Voice-Conversion-2

ChenNan1996 / PCF-NAT

DataoceanAI / Dolphin

LianjiaTech / BELLE

shuaijiang / Whisper-Finetune