Skip to content
View xingchensong's full-sized avatar
🐢
slow working
🐢
slow working

Highlights

  • Pro

Organizations

@thuhcsi @SpeechXC @wenet-e2e

Block or report xingchensong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Audio-Oscar is a multi-agent framework for generating long-form, controllable audio from complex audio scene descriptions.

Python 41 4 Updated Jun 8, 2026

HoliTok:A Coutinuous Holistic Tokenization with Robust Dual Capabilities of Speech Generation and Understanding

Python 28 1 Updated Jun 8, 2026
Python 495 35 Updated Jun 12, 2026

An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

Python 244 10 Updated Jun 4, 2026

Unofficial fairseq-free PyTorch implementation of UTMOS (v1, 2022), matching the original system.

Python 33 1 Updated Jun 6, 2026

🎨 Local-first, open-source Claude Design alternative. 🖥️ Native desktop app. ⚡ 259+ Skills · ✨ 142+ Design Systems 🖼️ Web · desktop · mobile prototypes · slides · images · videos · HyperFrames 📦 Sa…

TypeScript 64,171 7,161 Updated Jun 13, 2026

MOSS-Music is an open-source music understanding model for targeting musical captioning, lyrics ASR, structural analysis, chord / key / tempo reasoning, and long-form musical question answering.

Python 92 6 Updated May 9, 2026

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 220 16 Updated Jul 25, 2024

Generative models for conditional audio generation

Python 3,773 468 Updated May 26, 2026

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 423 45 Updated Jun 13, 2026

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

Python 3,479 448 Updated Jun 2, 2026

数字生命卡兹克开源的 AI Skills 合集

Python 14,651 1,789 Updated Jun 4, 2026

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 134 9 Updated May 22, 2026

A batch scoring tool for speaker similarity evaluation.

Python 6 1 Updated Dec 17, 2025

Public release of the Sound Effect Foundation model by Sony AI.

Python 316 22 Updated May 21, 2026

Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocations such as NCCL, ...)

Python 110 8 Updated Sep 11, 2025

My set of tools for SGLang development

Python 9 Updated Apr 9, 2026

OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.

TypeScript 30,857 2,427 Updated Jun 13, 2026

From Early Internet Design Patterns to AI Agent Implementation — A Deep Dive into Claude Code for Developers

JavaScript 831 277 Updated Apr 15, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,406 141 Updated Mar 19, 2026

Agentic Kernel Optimization for All — automated GPU kernel optimization for any kernel, any hardware, any language

Python 285 19 Updated May 31, 2026

Find slow PyTorch training bottlenecks: DataLoader stalls, low GPU utilization, rank stragglers, memory creep, and run regressions.

Python 172 16 Updated Jun 13, 2026

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 314 23 Updated May 31, 2026
JavaScript 13 Updated Apr 10, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 488 205 Updated Jun 13, 2026

Pure Rust + CUDA LLM inference engine

Rust 383 46 Updated Jun 13, 2026
Next