Skip to content
View HBX-hbx's full-sized avatar

Highlights

  • Pro

Block or report HBX-hbx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 1,037 79 Updated Mar 11, 2026

[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Python 257 12 Updated Mar 11, 2026

The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents"

Python 30 Updated Mar 30, 2026

Awesome Open-ended AI

417 44 Updated Feb 26, 2026

The official repository for the dataset FactualBench, which is introduced in paper "Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization".

Python 3 Updated Dec 30, 2025

My learning notes for ML SYS.

Python 5,870 382 Updated Apr 3, 2026

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,774 556 Updated Feb 11, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,413 128 Updated Nov 9, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,838 108 Updated Mar 18, 2025

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 18 1 Updated Dec 19, 2024

A large-scale, fine-grained, diverse preference dataset (and models).

Python 366 15 Updated Dec 29, 2023

A bibliography and survey of the papers surrounding o1

TeX 1,214 51 Updated Nov 16, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,563 1,120 Updated Apr 3, 2026

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,904 369 Updated Dec 17, 2025

Code for the paper "The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning"

Python 5 Updated Apr 8, 2025
Python 1,346 53 Updated Nov 21, 2024

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 62 7 Updated Feb 20, 2024

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8,098 491 Updated Sep 12, 2025

LaTeX Thesis Template for Tsinghua University

TeX 5,243 1,144 Updated Apr 4, 2026

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 532 52 Updated Jan 11, 2024

Linux内核源码分析

1,637 356 Updated Sep 5, 2023

Chrome Extensions Samples

JavaScript 17,441 9,001 Updated Mar 27, 2026

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python 202 27 Updated Apr 10, 2023

你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码

HTML 22,220 2,930 Updated Mar 22, 2025

What if you need more exercises?

Python 33 5 Updated Jul 16, 2024

什么?你敢放心的把后背交给 AI? 我赌你不敢,那就来学学 AI 时代最安全的语言吧(Python无法战胜!)。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,这可能是目前最用心的 Rust 中文学习教程 / Book

Rust 30,201 2,566 Updated Mar 12, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,164 10,891 Updated Mar 21, 2026

计算机自学指南

HTML 72,142 7,856 Updated Feb 24, 2026
Next