HBX-hbx

Follow

He Bingxiang HBX-hbx

Follow

Second Year PhD Candidate of Tsinghua University @thunlp

41 followers · 16 following

Tsinghua University
Beijing, China
22:07 (UTC +08:00)
https://hbx-hbx.github.io/
@hbx_hbx

Achievements

Achievements

Highlights

Pro

Stars

43 results for source starred repositories

thunlp / OPD

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 187 5 Updated Apr 29, 2026

PRIME-RL / TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 1,059 82 Updated Apr 15, 2026

thunlp / JustRL

[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Python 270 12 Updated Apr 18, 2026

JiayuJeff / CostBench

The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents"

Python 30 Updated Apr 9, 2026

jennyzzt / awesome-open-ended

Awesome Open-ended AI

430 44 Updated Apr 16, 2026

ZSYNOTZSH / FactualBench

The official repository for the dataset FactualBench, which is introduced in paper "Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization".

Python 3 Updated Dec 30, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,160 401 Updated Apr 23, 2026

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,851 571 Updated Feb 11, 2026

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,448 130 Updated Nov 9, 2025

HBX-hbx / Dynamics-of-Zero-Shot-Generalization

Python 1 Updated Oct 18, 2024

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,852 112 Updated Mar 18, 2025

qiancheng0 / EscapeBench

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 18 1 Updated Dec 19, 2024

OpenBMB / UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Python 367 16 Updated Dec 29, 2023

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,727 1,123 Updated Apr 30, 2026

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,906 368 Updated Dec 17, 2025

thunlp / Dynamics-of-Zero-Shot-Generalization

Code for the paper "The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning"

Python 5 Updated Apr 8, 2025

Open-Source-O1 / Open-O1

Python 1,346 54 Updated Nov 21, 2024

LC044 / WeChatMsg

41,309 5,073 Updated Dec 30, 2025

OpenBMB / Tell_Me_More

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 62 7 Updated Feb 20, 2024

WooooDyy / LLM-Agent-Paper-List

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8,112 492 Updated Sep 12, 2025

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,287 1,146 Updated Apr 29, 2026

Weixin-Liang / LLM-scientific-feedback

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 531 52 Updated Jan 11, 2024

liexusong / linux-source-code-analyze

Linux内核源码分析

1,636 358 Updated Sep 5, 2023

GoogleChrome / chrome-extensions-samples

Chrome Extensions Samples

JavaScript 17,502 9,012 Updated Apr 17, 2026

thunlp / OpenBackdoor

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python 206 27 Updated Apr 10, 2023

dibingfa / flash-linux0.11-talk

你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码

HTML 22,305 2,936 Updated Mar 22, 2025

Btlmd / IAI_Gen

What if you need more exercises?

Python 33 5 Updated Jul 16, 2024

sunface / rust-course

什么？你敢放心的把后背交给 AI? 我赌你不敢，那就来学学 AI 时代最安全的语言吧(Python无法战胜！)。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容，这可能是目前最用心的 Rust 中文学习教程 / Book

Rust 30,314 2,571 Updated Apr 27, 2026

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,718 10,932 Updated Apr 12, 2026