Skip to content
View chxliou's full-sized avatar

Highlights

  • Pro

Block or report chxliou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,345 11,083 Updated Nov 6, 2025

🙌 OpenHands: Code Less, Make More

Python 64,750 7,871 Updated Nov 6, 2025

Repository for the Tetrad Project, www.phil.cmu.edu/tetrad.

Java 430 116 Updated Nov 6, 2025

A language agent gym with challenging scientific tasks

Python 209 25 Updated Nov 6, 2025

Framework enabling modular interchange of language agents, environments, and optimizers

Python 110 15 Updated Nov 6, 2025

A Python package for causal inference in quasi-experimental settings

Python 1,057 85 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,988 3,927 Updated Nov 6, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,260 420 Updated Nov 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,969 7,494 Updated Nov 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,179 2,438 Updated Nov 6, 2025

📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.

266 6 Updated Nov 5, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,679 440 Updated Nov 4, 2025

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

HTML 158 15 Updated Nov 3, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,996 299 Updated Nov 3, 2025

Training Sparse Autoencoders on Language Models

Python 1,031 200 Updated Nov 2, 2025

Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.

216 17 Updated Oct 31, 2025

Python package for Causal Discovery by learning the graphical structure of Bayesian networks. Structure Learning, Parameter Learning, Inferences, Sampling methods.

Jupyter Notebook 555 53 Updated Oct 31, 2025

[TMLR 2025] Efficient Reasoning Models: A Survey

Python 275 18 Updated Oct 30, 2025

The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"

Python 188 28 Updated Oct 29, 2025

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 894 82 Updated Oct 28, 2025

Verify Precision of all Kimi K2 API Vendor

Python 325 16 Updated Oct 27, 2025

优质稳定的OpenAI、Gemini、Claude等的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持Anthropic claude的官方接口形式,支持Google gemini的官方接口形式,支持:gpt-5,sora。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增

PHP 843 75 Updated Oct 24, 2025

A Comprehensive Survey on World Models for Embodied AI

97 5 Updated Oct 23, 2025

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

Python 196 23 Updated Oct 21, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

680 33 Updated Oct 20, 2025

Obsidian tars plugin that supports text generation based on tag suggestions, using services like DeepSeek, Claude, OpenAI, OpenRouter, SiliconFlow, Gemini, Ollama, Kimi, Doubao, Qwen, Zhipu, QianFa…

TypeScript 199 21 Updated Oct 20, 2025

Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.

Python 283 18 Updated Oct 18, 2025

Pretraining and inference code for a large-scale depth-recurrent language model

Python 842 72 Updated Oct 16, 2025

Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"

Python 268 24 Updated Oct 16, 2025

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 825 91 Updated Oct 13, 2025
Next