Skip to content
View huckiyang's full-sized avatar
💮
love life. live life.
💮
love life. live life.

Highlights

  • Pro

Block or report huckiyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

JAX implementation of configurable LLM distillation training

Python 24 4 Updated Nov 15, 2025

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 7,345 632 Updated Jun 6, 2026

JAX backend for SGL

Python 281 104 Updated Jun 13, 2026

Gemma open-weight LLM library, from Google DeepMind

Python 5,410 953 Updated Jun 12, 2026

Run LLMs with MLX

Python 5,850 768 Updated Jun 12, 2026

MLX: An array framework for Apple silicon

C++ 26,959 1,902 Updated Jun 13, 2026

A Lightweight LLM Post-Training Library

Python 2,336 307 Updated Jun 13, 2026
Python 2 Updated Apr 8, 2026

A general purpose scientific writer

Python 1,936 232 Updated Jun 10, 2026

Kosmos: An AI Scientist for Autonomous Discovery - An implementation and adaptation to be driven by Claude Code or API - Based on the Kosmos AI Paper - https://arxiv.org/abs/2511.02824

Python 539 96 Updated Apr 4, 2026

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 672 52 Updated Feb 26, 2026
Python 7 3 Updated Nov 17, 2025

Accepted by ACL26 Main, Oral

Python 43 Updated May 19, 2026

[ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Python 47 3 Updated Apr 21, 2026

LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 1,177 187 Updated Jun 12, 2026

GPT-4o-level, real-time spoken dialogue system.

Python 377 33 Updated Jan 27, 2025
Python 4 1 Updated Mar 25, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,633 2,494 Updated May 25, 2026

Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)

Python 17 Updated Nov 14, 2024

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

C 8,439 456 Updated Jun 2, 2026

Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"

HTML 127 10 Updated Jul 15, 2025

[EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"

Python 23 4 Updated Jun 29, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10,393 970 Updated May 16, 2026

PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)

TypeScript 113 61 Updated Feb 17, 2026

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

Jupyter Notebook 41 1 Updated Jan 6, 2024
15 2 Updated Jul 4, 2024

Chronos: Pretrained Models for Time Series Forecasting

Python 5,455 650 Updated Jun 12, 2026

toLLMatch🔪: Context-aware LLM-based simultaneous translation

Jupyter Notebook 10 3 Updated Mar 6, 2025
Next