Skip to content
View cliang1453's full-sized avatar

Block or report cliang1453

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 445 15 Updated May 13, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,228 251 Updated Dec 19, 2025

Esoteric Language Models

Python 109 15 Updated Nov 24, 2025

Lightweight coding agent that runs in your terminal

Rust 54,289 6,873 Updated Dec 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,612 2,853 Updated Dec 19, 2025

Muon is Scalable for LLM Training

1,384 78 Updated Aug 3, 2025
Python 14 Updated Feb 27, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,946 288 Updated May 15, 2025

Fully open reproduction of DeepSeek-R1

Python 25,737 2,405 Updated Nov 24, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,111 99 Updated Nov 23, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,214 51 Updated Nov 16, 2024

NanoGPT (124M) in 3 minutes

Python 3,969 520 Updated Dec 17, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,705 3,806 Updated Dec 19, 2025

A family of compressed models obtained via pruning and knowledge distillation

361 18 Updated Nov 6, 2025

Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"

Python 500 41 Updated Mar 7, 2023

Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair

Jupyter Notebook 52 8 Updated Jan 29, 2024

Development repository for the Triton language and compiler

MLIR 17,877 2,458 Updated Dec 19, 2025
Python 235 23 Updated Jun 11, 2024

A framework for few-shot evaluation of language models.

Python 10,973 2,909 Updated Dec 18, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,235 3,674 Updated Jul 4, 2024
Python 1,556 160 Updated Dec 16, 2025

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

Python 182 28 Updated Oct 28, 2022

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Python 463 53 Updated Nov 5, 2022

Toolkit for creating, sharing and using natural language prompts.

Python 2,983 377 Updated Oct 23, 2023

Paper List for In-context Learning 🌷

871 63 Updated Oct 8, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,228 357 Updated Dec 11, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,732 482 Updated Jan 8, 2024
Next