Skip to content
View chxliou's full-sized avatar

Highlights

  • Pro

Block or report chxliou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

57 stars written in Python
Clear filter

🙌 OpenHands: Code Less, Make More

Python 64,733 7,870 Updated Nov 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,259 11,065 Updated Nov 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,931 7,492 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,961 3,923 Updated Nov 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,163 2,435 Updated Nov 6, 2025

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,290 1,836 Updated Jul 3, 2024

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,993 298 Updated Nov 3, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,258 419 Updated Nov 3, 2025

Fully open data curation for reasoning models

Python 2,134 176 Updated Sep 3, 2025

Official Repo for Open-Reasoner-Zero

Python 2,059 119 Updated Jun 2, 2025

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,618 161 Updated Oct 28, 2024

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,320 139 Updated Aug 12, 2025

A Python package for causal inference in quasi-experimental settings

Python 1,057 85 Updated Nov 6, 2025

Training Sparse Autoencoders on Language Models

Python 1,028 199 Updated Nov 2, 2025

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Python 910 63 Updated Jun 8, 2025

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 894 82 Updated Oct 28, 2025

Pretraining and inference code for a large-scale depth-recurrent language model

Python 842 72 Updated Oct 16, 2025

Chemcrow

Python 831 130 Updated Dec 19, 2024

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 824 91 Updated Oct 13, 2025

Tool for data extraction and interacting with Lean programmatically.

Python 720 114 Updated Sep 13, 2025

Verify Precision of all Kimi K2 API Vendor

Python 324 16 Updated Oct 27, 2025

Tina: Tiny Reasoning Models via LoRA

Python 304 37 Updated Sep 23, 2025

Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.

Python 283 18 Updated Oct 18, 2025

[TMLR 2025] Efficient Reasoning Models: A Survey

Python 275 18 Updated Oct 30, 2025

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 268 24 Updated Oct 16, 2025

Automated Hypothesis Testing with Agentic Sequential Falsifications

Python 230 24 Updated May 14, 2025

A language agent gym with challenging scientific tasks

Python 208 25 Updated Nov 5, 2025

TART: A plug-and-play Transformer module for task-agnostic reasoning

Python 200 15 Updated Jun 22, 2023
Next