kingfener

king kingfener

a man open the new world

2 followers · 3 following

Beijing

Achievements

Stars

mit-han-lab / vlash

Real-Time VLAs via Future-state-aware Asynchronous Inference.

Python 242 10 Updated Dec 21, 2025

magic-research / piecewise-rectified-flow

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 529 31 Updated Sep 8, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,834 501 Updated Dec 5, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,864 4,112 Updated Dec 23, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,763 1,179 Updated Sep 26, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,436 1,689 Updated Sep 24, 2025

stefantaubert / pinyin-to-ipa

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

Python 53 10 Updated Apr 16, 2025

wenet-e2e / wesep

Target Speaker Extraction Toolkit

Python 233 32 Updated Oct 4, 2025

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,478 416 Updated Apr 20, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,646 749 Updated Sep 22, 2025

AI-Hobbyist / Genshin_Datasets

Genshin Datasets For SVC/SVS/TTS

701 40 Updated Jul 27, 2025

scateu / tsv_edl.vim

video editing with vim/spreadsheet/sed/python. methodology inspired by BBC digital paper edit. "Excel-dit"

Python 97 5 Updated Jul 30, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,069 3,204 Updated Dec 19, 2025

CPJKU / asap-dataset

Forked from fosfrancesco/asap-dataset

A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

Jupyter Notebook 35 4 Updated Jul 31, 2025

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,729 1,168 Updated Nov 14, 2024

Audio-AGI / AudioSep

Official implementation of "Separate Anything You Describe"

Python 1,855 140 Updated Nov 26, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,046 6,639 Updated Sep 30, 2025

Visualize-ML / Book7_Visualizations-for-Machine-Learning

Book_7_《机器学习》 | 鸢尾花书：从加减乘除到机器学习；欢迎批评指正

Jupyter Notebook 3,126 583 Updated Dec 10, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,982 5,864 Updated Aug 16, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,865 344 Updated Jan 4, 2024

cnlinxi / book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

TeX 613 80 Updated Apr 19, 2022

wenet-e2e / speech-recognition-papers

Towards hot directions in industrial end to end speech recognition

331 40 Updated Nov 30, 2021

wenet-e2e / speech-synthesis-paper

List of speech synthesis papers.

1,061 124 Updated Jul 24, 2023

keonlee9420 / Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python 191 44 Updated Nov 18, 2021

NVIDIA / mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 864 185 Updated Jul 22, 2023

athena-team / athena

an open-source implementation of sequence-to-sequence based speech processing engine

C++ 965 201 Updated Dec 2, 2022

labuladong / fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

Markdown 131,400 23,616 Updated Oct 8, 2025

nobody132 / masr

中文语音识别; Mandarin Automatic Speech Recognition;

Python 1,961 482 Updated Jul 25, 2024

srvk / eesen

The official repository of the Eesen project

C++ 831 340 Updated May 23, 2019

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,655 2,364 Updated Dec 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

king kingfener

Achievements

Achievements

Block or report kingfener

Stars

mit-han-lab / vlash

magic-research / piecewise-rectified-flow

canopyai / Orpheus-TTS

unslothai / unsloth

QwenLM / Qwen-Agent

OpenBMB / MiniCPM-V

stefantaubert / pinyin-to-ipa

wenet-e2e / wesep

Plachtaa / seed-vc

OpenGVLab / InternVL

AI-Hobbyist / Genshin_Datasets

scateu / tsv_edl.vim

gradio-app / gradio

CPJKU / asap-dataset

facebookresearch / seamless_communication

Audio-AGI / AudioSep

facebookresearch / fairseq

Visualize-ML / Book7_Visualizations-for-Machine-Learning

coqui-ai / TTS

facebookresearch / encodec

cnlinxi / book-text-to-speech

wenet-e2e / speech-recognition-papers

wenet-e2e / speech-synthesis-paper

keonlee9420 / Parallel-Tacotron2

NVIDIA / mellotron

athena-team / athena

labuladong / fucking-algorithm

nobody132 / masr

srvk / eesen

espnet / espnet