npuichigo

Follow

🎹

Focusing

Yuchao Zhang npuichigo

🎹

Focusing

Follow

speech synthesis/machine learning c++/rust/python

320 followers · 223 following

Speechify

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Starred repositories

sgl-project / mini-sglang

Python 1,172 84 Updated Dec 18, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 1,653 120 Updated Dec 18, 2025

KdaiP / DC-Speech-VAE

5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs

Python 55 9 Updated Nov 19, 2025

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,197 266 Updated Dec 16, 2025

zai-org / GLM-TTS

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 733 87 Updated Dec 17, 2025

cfregly / ai-performance-engineering

Python 791 106 Updated Dec 19, 2025

NVIDIA / TileGym

Helpful kernel tutorials and examples for tile-based GPU programming

Python 454 22 Updated Dec 18, 2025

dsl-learn / cutile-learn

NVIDIA cuTile learn

Python 128 Updated Dec 9, 2025

huggingface / skills

Python 622 66 Updated Dec 15, 2025

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,623 83 Updated Dec 19, 2025

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,006 118 Updated Dec 3, 2025

nahratzah / senders_receivers

senders/receivers implementation in rust

Rust 2 Updated Aug 11, 2024

archinetai / cqt-pytorch

An invertible and differentiable implementation of the Constant-Q Transform (CQT).

Python 69 4 Updated Dec 9, 2022

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 992 135 Updated Dec 19, 2025

dinhoitt / BemaGANv2

Python 19 3 Updated Jun 3, 2025

maikel / tutorial_stdexec

Another Tutorial on std::execution

C++ 11 Updated Nov 26, 2024

facebookresearch / FlowDec

An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.

Python 193 17 Updated Jul 14, 2025

Tencent-Hunyuan / flex-block-attn

flex-block-attn: an efficient block sparse attention computation library

Jupyter Notebook 96 6 Updated Nov 24, 2025

fallenshock / FlowEdit

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 890 41 Updated Dec 18, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,482 213 Updated Dec 16, 2025

OHF-Voice / piper1-gpl

Fast and local neural text-to-speech engine

C++ 2,069 212 Updated Nov 12, 2025

SeunggeunKimkr / PRISM

Public repository for fine-tuning Masked Diffusion Models toward provable self-correction.

Python 19 1 Updated Nov 10, 2025

NVlabs / Fast-dLLM

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 741 70 Updated Nov 28, 2025

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 1,429 144 Updated Dec 18, 2025

cocoindex-io / cocoindex

Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!

Rust 4,004 324 Updated Dec 18, 2025

ASLP-lab / DiffRhythm2

Forked from xiaomi-research/diffrhythm2

Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching

Python 129 4 Updated Nov 9, 2025

shaochenze / calm

Official implementation of "Continuous Autoregressive Language Models"

Python 672 80 Updated Dec 1, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 443 24 Updated Dec 15, 2025

harvard-edge / cs249r_book

Introduction to Machine Learning Systems

JavaScript 11,000 1,232 Updated Dec 18, 2025

kuleshov-group / e2d2

[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference

Python 31 3 Updated Oct 29, 2025

Starred topics

neural-vocoder

openfst

onnx