Skip to content
View chxliou's full-sized avatar

Highlights

  • Pro

Block or report chxliou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

50 results for source starred repositories written in Python
Clear filter

🙌 OpenHands: Code Less, Make More

Python 64,768 7,868 Updated Nov 7, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,446 11,110 Updated Nov 7, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,034 7,498 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,020 3,931 Updated Nov 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,217 2,442 Updated Nov 7, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,004 299 Updated Nov 3, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,263 420 Updated Nov 7, 2025

Fully open data curation for reasoning models

Python 2,136 177 Updated Sep 3, 2025

Official Repo for Open-Reasoner-Zero

Python 2,059 119 Updated Jun 2, 2025

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,619 162 Updated Oct 28, 2024

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,321 140 Updated Aug 12, 2025

A Python package for causal inference in quasi-experimental settings

Python 1,057 85 Updated Nov 7, 2025

Training Sparse Autoencoders on Language Models

Python 1,035 200 Updated Nov 2, 2025

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Python 910 63 Updated Jun 8, 2025

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 894 82 Updated Oct 28, 2025

Pretraining and inference code for a large-scale depth-recurrent language model

Python 842 72 Updated Oct 16, 2025

Chemcrow

Python 833 130 Updated Dec 19, 2024

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 826 91 Updated Oct 13, 2025

Tool for data extraction and interacting with Lean programmatically.

Python 721 114 Updated Sep 13, 2025

Verify Precision of all Kimi K2 API Vendor

Python 334 16 Updated Oct 27, 2025

Tina: Tiny Reasoning Models via LoRA

Python 304 37 Updated Sep 23, 2025

Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.

Python 283 18 Updated Oct 18, 2025

[TMLR 2025] Efficient Reasoning Models: A Survey

Python 275 18 Updated Oct 30, 2025

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 268 24 Updated Oct 16, 2025

Automated Hypothesis Testing with Agentic Sequential Falsifications

Python 230 24 Updated May 14, 2025

A language agent gym with challenging scientific tasks

Python 210 25 Updated Nov 6, 2025

TART: A plug-and-play Transformer module for task-agnostic reasoning

Python 200 15 Updated Jun 22, 2023

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

Python 196 23 Updated Oct 21, 2025
Next