Skip to content
View npuichigo's full-sized avatar
🎹
Focusing
🎹
Focusing
  • Speechify

Block or report npuichigo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,172 84 Updated Dec 18, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 1,653 120 Updated Dec 18, 2025

5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs

Python 55 9 Updated Nov 19, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,197 266 Updated Dec 16, 2025

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 733 87 Updated Dec 17, 2025

Helpful kernel tutorials and examples for tile-based GPU programming

Python 454 22 Updated Dec 18, 2025

NVIDIA cuTile learn

Python 128 Updated Dec 9, 2025
Python 622 66 Updated Dec 15, 2025

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,623 83 Updated Dec 19, 2025

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,006 118 Updated Dec 3, 2025

senders/receivers implementation in rust

Rust 2 Updated Aug 11, 2024

An invertible and differentiable implementation of the Constant-Q Transform (CQT).

Python 69 4 Updated Dec 9, 2022

A framework for efficient model inference with omni-modality models

Python 992 135 Updated Dec 19, 2025
Python 19 3 Updated Jun 3, 2025

Another Tutorial on std::execution

C++ 11 Updated Nov 26, 2024

An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.

Python 193 17 Updated Jul 14, 2025

flex-block-attn: an efficient block sparse attention computation library

Jupyter Notebook 96 6 Updated Nov 24, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 890 41 Updated Dec 18, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,482 213 Updated Dec 16, 2025

Fast and local neural text-to-speech engine

C++ 2,069 212 Updated Nov 12, 2025

Public repository for fine-tuning Masked Diffusion Models toward provable self-correction.

Python 19 1 Updated Nov 10, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 741 70 Updated Nov 28, 2025

dLLM: Simple Diffusion Language Modeling

Python 1,429 144 Updated Dec 18, 2025

Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!

Rust 4,004 324 Updated Dec 18, 2025

Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching

Python 129 4 Updated Nov 9, 2025

Official implementation of "Continuous Autoregressive Language Models"

Python 672 80 Updated Dec 1, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 443 24 Updated Dec 15, 2025

Introduction to Machine Learning Systems

JavaScript 11,000 1,232 Updated Dec 18, 2025

[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference

Python 31 3 Updated Oct 29, 2025
Next