-
Tsinghua University (2019-2022), WeNet Community (2021-now)
- Beijing, China
-
01:13
(UTC +08:00) - xingchensong.github.io
- https://blog.csdn.net/zongza
- https://scholar.google.com/citations?user=65eIdn4AAAAJ&hl=zh-CN
Highlights
- Pro
-
S3Tokenizer Public
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
-
FlashCosyVoice Public
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
-
-
TouchNet Public
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
-
-
-
torchtitan Public
Forked from pytorch/torchtitanA PyTorch native library for large model training
-
-
yt-dlp Public
Forked from yt-dlp/yt-dlpA feature-rich command-line audio/video downloader
Python The Unlicense UpdatedSep 11, 2024 -
FunASR Public
Forked from modelscope/FunASRA Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Python Other UpdatedJul 18, 2024 -
whisper.cpp Public
Forked from ggml-org/whisper.cppPort of OpenAI's Whisper model in C/C++
-
OmniQuant Public
Forked from OpenGVLab/OmniQuantOmniQuant is a simple and powerful quantization technique for LLMs.
-
vimspector Public
Forked from puremourning/vimspectorvimspector - A multi-language debugging system for Vim
Vim Script Apache License 2.0 UpdatedJan 17, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C MIT License UpdatedJan 8, 2024 -
ggml Public
Forked from ggml-org/ggmlTensor library for machine learning
C MIT License UpdatedSep 28, 2023 -
InferLLM Public
Forked from MegEngine/InferLLMa lightweight LLM model inference framework
-
wukong-robot Public
Forked from wzpan/wukong-robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
-
-
-
-
wenet Public
Forked from wenet-e2e/wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
C++ Apache License 2.0 UpdatedOct 31, 2021 -
-
-
Django_with_ModulesPack2 Public
Django web framework with modulespack2-Plugin
Python UpdatedMay 12, 2019 -
Speech-Transformer-plus-2DAttention Public
Forked from kaituoxu/Speech-TransformerA PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
-
Speech-Transformer-tf2.0 Public
transformer for ASR-systerm (via tensorflow2.0)
-
ASR-Wavnet Public
some ASR-system implementations (via tensorflow 1.x)
-
ModulesPack2 Public
ModulesPack version2
-