Skip to content
View cliang1453's full-sized avatar

Block or report cliang1453

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
112 results for source starred repositories
Clear filter

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 452 16 Updated May 13, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,501 279 Updated Feb 5, 2026

Esoteric Language Models

Python 110 15 Updated Nov 24, 2025

Lightweight coding agent that runs in your terminal

Rust 59,080 7,717 Updated Feb 5, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,010 3,191 Updated Feb 5, 2026

Muon is Scalable for LLM Training

1,423 82 Updated Aug 3, 2025
Python 17 1 Updated Feb 2, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,962 289 Updated May 15, 2025

Fully open reproduction of DeepSeek-R1

Python 25,858 2,413 Updated Nov 24, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,266 105 Updated Jan 19, 2026

A bibliography and survey of the papers surrounding o1

TeX 1,212 51 Updated Nov 16, 2024

NanoGPT (124M) in 2 minutes

Python 4,583 607 Updated Feb 1, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,331 4,335 Updated Feb 5, 2026

A family of compressed models obtained via pruning and knowledge distillation

364 18 Updated Nov 6, 2025

Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"

Python 503 42 Updated Mar 7, 2023

Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair

Jupyter Notebook 52 8 Updated Jan 29, 2024

Development repository for the Triton language and compiler

MLIR 18,357 2,553 Updated Feb 5, 2026
Python 235 23 Updated Jun 11, 2024

A framework for few-shot evaluation of language models.

Python 11,366 3,018 Updated Feb 5, 2026

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,224 3,663 Updated Jul 4, 2024
Python 1,559 159 Updated Feb 5, 2026

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

Python 183 28 Updated Oct 28, 2022

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Python 465 53 Updated Nov 5, 2022

Toolkit for creating, sharing and using natural language prompts.

Python 2,997 378 Updated Oct 23, 2023

Paper List for In-context Learning 🌷

874 63 Updated Oct 8, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,277 367 Updated Dec 22, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,741 482 Updated Jan 8, 2024
Next