- Shanghai,China
- blog.csdn.net/zhulinniao
-
index-tts Public
Forked from index-tts/index-ttsAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Python Apache License 2.0 UpdatedSep 8, 2025 -
ClearerVoice-Studio Public
Forked from modelscope/ClearerVoice-StudioAn AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Python Apache License 2.0 UpdatedApr 15, 2025 -
Fast-Spark-TTS Public
Forked from HuiResearch/FlashTTS基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。
Python UpdatedApr 7, 2025 -
-
async_cosyvoice Public
Forked from qi-hua/async_cosyvoice使用vllm加速cosyvoice2的推理
-
LLaMA-Factory20250318 Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedMar 17, 2025 -
ChatTTS Public
Forked from 2noise/ChatTTSA generative speech model for daily dialogue.
Python GNU Affero General Public License v3.0 UpdatedMar 14, 2025 -
-
My-FunASR Public
Forked from peilongchencc/My-FunASR基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。
Python Apache License 2.0 UpdatedOct 12, 2024 -
-
pyResults Public
Forked from nuaalixu/pyResultsA tool for calculating WER (Word Error Rate) in python.
Python MIT License UpdatedSep 18, 2024 -
silero-vad5 Public
Forked from snakers4/silero-vadSilero VAD: pre-trained enterprise-grade Voice Activity Detector
Python MIT License UpdatedAug 22, 2024 -
-
west Public
Forked from wenet-e2e/wesrWe Speech Transcript based on LLM, in 300 lines of code.
Python Apache License 2.0 UpdatedAug 16, 2024 -
whisper-medusa Public
Forked from aiola-lab/whisper-medusaWhisper with Medusa heads
Python MIT License UpdatedAug 3, 2024 -
RealSI Public
Forked from byteresearchcla/RealSIRealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
Python Creative Commons Attribution 4.0 International UpdatedAug 3, 2024 -
faster-whisper Public
Forked from SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Python MIT License UpdatedJul 31, 2024 -
whisper.cpp Public
Forked from ggml-org/whisper.cppPort of OpenAI's Whisper model in C/C++
C++ MIT License UpdatedJul 31, 2024 -
whisperX Public
Forked from m-bain/whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Python BSD 2-Clause "Simplified" License UpdatedJul 11, 2024 -
whisper-jni Public
Forked from GiviMAD/whisper-jniA JNI wrapper for using whisper.cpp, allows to transcribe speech to text in Java.
Java Apache License 2.0 UpdatedJul 3, 2024 -
SD-Eval Public
Forked from amphionspace/SD-EvalSD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
Python Apache License 2.0 UpdatedJun 25, 2024 -
GLM-4 Public
Forked from zai-org/GLM-4GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Python Apache License 2.0 UpdatedJun 5, 2024 -
-
3D-Speaker Public
Forked from modelscope/3D-SpeakerA Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Python Apache License 2.0 UpdatedMay 30, 2024 -
k2-v2.0-pre-branch-HLG Public
Forked from k2-fsa/k2FSA/FST algorithms, differentiable, with PyTorch compatibility.
Cuda Apache License 2.0 UpdatedMay 17, 2024 -
attention-is-all-you-need-pytorch Public
Forked from jadore801120/attention-is-all-you-need-pytorchA PyTorch implementation of the Transformer model in "Attention is All You Need".
Python MIT License UpdatedApr 16, 2024 -
kaldi-native-fbank Public
Forked from csukuangfj/kaldi-native-fbankKaldi-compatible online fbank extractor without external dependencies
C++ Apache License 2.0 UpdatedApr 14, 2024 -
KNN-CTC Public
Forked from NKU-HLT/KNN-CTC[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
C++ Apache License 2.0 UpdatedMar 20, 2024 -
whisper-plus Public
Forked from kadirnar/whisper-plusWhisperPlus: Advancing Speech-to-Text Processing 🚀
-
riva-asrlib-decoder Public
Forked from nvidia-riva/riva-asrlib-decoderStandalone implementation of the CUDA-accelerated WFST Decoder available in Riva
Python UpdatedDec 20, 2023