xingchensong

🐢

slow working

Xingchen Song(宋星辰) xingchensong

🐢

slow working

Deaf | I like building tools~

540 followers · 591 following

Tsinghua University (2019-2022), WeNet Community (2021-now)
Beijing, China
19:53 (UTC +08:00)
xingchensong.github.io
https://blog.csdn.net/zongza
https://scholar.google.com/citations?user=65eIdn4AAAAJ&hl=zh-CN

Achievements

x3 x2 x3

Achievements

x3 x2 x3

Highlights

Organizations

Lists (22)

Sort

Stars

ziye26 / Audio-Oscar

Audio-Oscar is a multi-agent framework for generating long-form, controllable audio from complex audio scene descriptions.

Python 41 4 Updated Jun 8, 2026

bovod-sjtu / HoliTok

HoliTok:A Coutinuous Holistic Tokenization with Robust Dual Capabilities of Speech Generation and Understanding

Python 28 1 Updated Jun 8, 2026

rednote-hilab / dots.tts

Python 495 35 Updated Jun 12, 2026

Soul-AILab / SoulX-Transcriber

An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

Python 244 10 Updated Jun 4, 2026

Blinorot / utmos-pytorch

Unofficial fairseq-free PyTorch implementation of UTMOS (v1, 2022), matching the original system.

Python 33 1 Updated Jun 6, 2026

Stability-AI / stable-audio-3

Python 480 56 Updated Jun 9, 2026

nexu-io / open-design

🎨 Local-first, open-source Claude Design alternative. 🖥️ Native desktop app. ⚡ 259+ Skills · ✨ 142+ Design Systems 🖼️ Web · desktop · mobile prototypes · slides · images · videos · HyperFrames 📦 Sa…

TypeScript 64,171 7,161 Updated Jun 13, 2026

wx9Songs / MOSS-Music-Data-Pipeline

Python 36 4 Updated Apr 26, 2026

OpenMOSS / MOSS-Music

MOSS-Music is an open-source music understanding model for targeting musical captioning, lyrics ASR, structural analysis, chord / key / tempo reasoning, and long-form musical question answering.

Python 92 6 Updated May 9, 2026

yukara-ikemiya / friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 220 16 Updated Jul 25, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 3,773 468 Updated May 26, 2026

redai-infra / Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 423 45 Updated Jun 13, 2026

OpenMOSS / MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

Python 3,479 448 Updated Jun 2, 2026

KKKKhazix / khazix-skills

数字生命卡兹克开源的 AI Skills 合集

Python 14,651 1,789 Updated Jun 4, 2026

yzlnew / infra-skills

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 134 9 Updated May 22, 2026

shangguanqituan / spksim

A batch scoring tool for speaker similarity evaluation.

Python 6 1 Updated Dec 17, 2025

SonyResearch / Woosh

Public release of the Sound Effect Foundation model by Sony AI.

Python 316 22 Updated May 21, 2026

VibingJustSpeakIt / Vibing

HTML 497 49 Updated Apr 27, 2026

fzyzcjy / torch_utils

Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocations such as NCCL, ...)

Python 110 8 Updated Sep 11, 2025