Skip to content
View kingfener's full-sized avatar
  • Beijing

Block or report kingfener

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Real-Time VLAs via Future-state-aware Asynchronous Inference.

Python 242 10 Updated Dec 21, 2025

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 529 31 Updated Sep 8, 2025

Towards Human-Sounding Speech

Python 5,834 501 Updated Dec 5, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,864 4,112 Updated Dec 23, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,763 1,179 Updated Sep 26, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,436 1,689 Updated Sep 24, 2025

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

Python 53 10 Updated Apr 16, 2025

Target Speaker Extraction Toolkit

Python 233 32 Updated Oct 4, 2025

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,478 416 Updated Apr 20, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,646 749 Updated Sep 22, 2025

Genshin Datasets For SVC/SVS/TTS

701 40 Updated Jul 27, 2025

video editing with vim/spreadsheet/sed/python. methodology inspired by BBC digital paper edit. "Excel-dit"

Python 97 5 Updated Jul 30, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,069 3,204 Updated Dec 19, 2025

A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

Jupyter Notebook 35 4 Updated Jul 31, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,729 1,168 Updated Nov 14, 2024

Official implementation of "Separate Anything You Describe"

Python 1,855 140 Updated Nov 26, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,046 6,639 Updated Sep 30, 2025

Book_7_《机器学习》 | 鸢尾花书:从加减乘除到机器学习;欢迎批评指正

Jupyter Notebook 3,126 583 Updated Dec 10, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,982 5,864 Updated Aug 16, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,865 344 Updated Jan 4, 2024

A book about Text-to-Speech (TTS) in Chinese.

TeX 613 80 Updated Apr 19, 2022

Towards hot directions in industrial end to end speech recognition

331 40 Updated Nov 30, 2021

List of speech synthesis papers.

1,061 124 Updated Jul 24, 2023

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python 191 44 Updated Nov 18, 2021

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 864 185 Updated Jul 22, 2023

an open-source implementation of sequence-to-sequence based speech processing engine

C++ 965 201 Updated Dec 2, 2022

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 131,400 23,616 Updated Oct 8, 2025

中文语音识别; Mandarin Automatic Speech Recognition;

Python 1,961 482 Updated Jul 25, 2024

The official repository of the Eesen project

C++ 831 340 Updated May 23, 2019

End-to-End Speech Processing Toolkit

Python 9,655 2,364 Updated Dec 16, 2025
Next