huckiyang

💮

love life. live life.

huckiyang

💮

love life. live life.

Speech, Alignments, Robust LMs

129 followers · 88 following

Sr. Staff Member, Apple
12:55 (UTC -07:00)
huckiyang.github.io/
@huckiyang
channel/UCSj3hCBIds5BpyO7A4F3l7A

Achievements

Highlights

Lists (2)

Sort

🚀 My stack

1 repository

quantum-circuit-learning

1 repository

Stars

zlab-princeton / llm-distillation-jax

JAX implementation of configurable LLM distillation training

Python 24 4 Updated Nov 15, 2025

Blaizzy / mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 7,345 632 Updated Jun 6, 2026

sgl-project / sglang-jax

JAX backend for SGL

Python 281 104 Updated Jun 13, 2026

google-deepmind / gemma

Gemma open-weight LLM library, from Google DeepMind

Python 5,410 953 Updated Jun 12, 2026

ml-explore / mlx-lm

Run LLMs with MLX

Python 5,850 768 Updated Jun 12, 2026

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 26,959 1,902 Updated Jun 13, 2026

google / tunix

A Lightweight LLM Post-Training Library

Python 2,336 307 Updated Jun 13, 2026

N-Orien / CoVoGER

Python 2 Updated Apr 8, 2026

K-Dense-AI / claude-scientific-writer

A general purpose scientific writer

Python 1,936 232 Updated Jun 10, 2026

jimmc414 / Kosmos

Kosmos: An AI Scientist for Autonomous Discovery - An implementation and adaptation to be driven by Claude Code or API - Based on the Kosmos AI Paper - https://arxiv.org/abs/2511.02824

Python 539 96 Updated Apr 4, 2026

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 672 52 Updated Feb 26, 2026

CYLphysics / QPA

Python 7 3 Updated Nov 17, 2025

YukinoWan / Speech-Hands

Accepted by ACL26 Main, Oral

Python 43 Updated May 19, 2026

NVlabs / EoRA

[ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Python 47 3 Updated Apr 21, 2026

huckiyang / AudioLMs-Descriptive-Speech-Quality-Evaluators

ICLR 2025

Python 5 1 Updated Jun 2, 2025

ModelCloud / GPTQModel

LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 1,177 187 Updated Jun 12, 2026

OpenMOSS / SpeechGPT-2.0-preview

GPT-4o-level, real-time spoken dialogue system.

Python 377 33 Updated Jan 27, 2025

YukinoWan / SpeechIQ

Python 4 1 Updated Mar 25, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,633 2,494 Updated May 25, 2026

yichen14 / FastAdaSP

Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)

Python 17 Updated Nov 14, 2024

moonshine-ai / moonshine

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

C 8,439 456 Updated Jun 2, 2026

kehanlu / DeSTA2

Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"

HTML 127 10 Updated Jul 15, 2025

fzp0424 / MT-Ladder

[EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"

Python 23 4 Updated Jun 29, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10,393 970 Updated May 16, 2026

ruiyiw / patient-psi

PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)

TypeScript 113 61 Updated Feb 17, 2026

slSeanWU / beats-conformer-bart-audio-captioner

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

Jupyter Notebook 41 1 Updated Jan 6, 2024

rithiksachdev / PostASR-Correction-SLT2024

Python 18 2 Updated Jul 22, 2024

tzyll / ChineseHP

15 2 Updated Jul 4, 2024

amazon-science / chronos-forecasting

Chronos: Pretrained Models for Time Series Forecasting

Python 5,455 650 Updated Jun 12, 2026

RomanKoshkin / toLLMatch

toLLMatch🔪: Context-aware LLM-based simultaneous translation

Jupyter Notebook 10 3 Updated Mar 6, 2025