Skip to content
View HBX-hbx's full-sized avatar

Highlights

  • Pro

Block or report HBX-hbx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
43 results for source starred repositories
Clear filter

Official repository for the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe"

Python 58 Updated Apr 15, 2026

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 1,048 81 Updated Apr 15, 2026

[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Python 261 12 Updated Mar 11, 2026

The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents"

Python 30 Updated Apr 9, 2026

Awesome Open-ended AI

426 44 Updated Apr 16, 2026

The official repository for the dataset FactualBench, which is introduced in paper "Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization".

Python 3 Updated Dec 30, 2025

My learning notes for ML SYS.

Python 6,018 394 Updated Apr 8, 2026

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,820 568 Updated Feb 11, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,433 129 Updated Nov 9, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,844 111 Updated Mar 18, 2025

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 18 1 Updated Dec 19, 2024

A large-scale, fine-grained, diverse preference dataset (and models).

Python 367 16 Updated Dec 29, 2023

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,651 1,126 Updated Apr 9, 2026

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,906 368 Updated Dec 17, 2025

Code for the paper "The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning"

Python 5 Updated Apr 8, 2025
Python 1,345 54 Updated Nov 21, 2024

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 62 7 Updated Feb 20, 2024

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8,109 492 Updated Sep 12, 2025

LaTeX Thesis Template for Tsinghua University

TeX 5,255 1,143 Updated Apr 4, 2026

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 531 51 Updated Jan 11, 2024

Linux内核源码分析

1,637 356 Updated Sep 5, 2023

Chrome Extensions Samples

JavaScript 17,468 9,018 Updated Apr 16, 2026

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python 204 27 Updated Apr 10, 2023

你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码

HTML 22,241 2,932 Updated Mar 22, 2025

What if you need more exercises?

Python 33 5 Updated Jul 16, 2024

什么?你敢放心的把后背交给 AI? 我赌你不敢,那就来学学 AI 时代最安全的语言吧(Python无法战胜!)。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,这可能是目前最用心的 Rust 中文学习教程 / Book

Rust 30,257 2,570 Updated Mar 12, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,366 10,898 Updated Apr 12, 2026
Next