SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Python 228 14 Updated May 18, 2025

stepfun-ai / Step-Audio

Python 4,609 373 Updated Jan 30, 2026

mli / paper-reading

深度学习经典、新论文逐段精读

32,521 2,775 Updated Mar 22, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,715 2,379 Updated Feb 4, 2026

lucadellalib / focalcodec

A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation

Jupyter Notebook 140 14 Updated Nov 30, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 670 49 Updated Jun 5, 2025

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

7,473 723 Updated Feb 3, 2026

kehanlu / DeSTA2

Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"

HTML 120 10 Updated Jul 15, 2025

brownvc / R3GAN

Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.

Python 852 45 Updated Jan 23, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 349 48 Updated Jul 21, 2025

LqNoob / Neural-Codec-and-Speech-Language-Models

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 239 13 Updated Dec 18, 2025

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 415 29 Updated Sep 15, 2025

facebookincubator / submitit

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,573 146 Updated Jan 14, 2026

facebookresearch / muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Python 400 35 Updated Sep 11, 2023

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 67,452 8,395 Updated Feb 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chien-yu Huang cyhuang-tw

Achievements

Achievements

Highlights

Block or report cyhuang-tw

Stars

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

OpenBMB / UltraEval-Audio

LTH14 / JiT

jctian98 / espnet

JerryHoTaiwan / DeepWaveOptics

stepfun-ai / Step-Audio2

kehanlu / lulutils

kehanlu / DeSTA2.5-Audio

QwenLM / Qwen3

wavlab-speech / versa

DanielLin94144 / Full-Duplex-Bench

hacksider / Deep-Live-Cam

Anduin2017 / HowToCook

jasonppy / VoiceStar

AudioLLMs / Awesome-Audio-LLM

slp-rl / slamkit