HBX-hbx

Follow

He Bingxiang HBX-hbx

Follow

Second Year PhD Candidate of Tsinghua University @thunlp

37 followers · 16 following

Tsinghua University
Beijing, China
17:49 (UTC +08:00)
https://hbx-hbx.github.io/
@hbx_hbx

Achievements

Achievements

Highlights

Pro

Stars

PRIME-RL / TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 1,037 79 Updated Mar 11, 2026

thunlp / JustRL

[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Python 257 12 Updated Mar 11, 2026

JiayuJeff / CostBench

The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents"

Python 30 Updated Mar 30, 2026

jennyzzt / awesome-open-ended

Awesome Open-ended AI

417 44 Updated Feb 26, 2026

ZSYNOTZSH / FactualBench

The official repository for the dataset FactualBench, which is introduced in paper "Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization".

Python 3 Updated Dec 30, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,870 382 Updated Apr 3, 2026

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,774 556 Updated Feb 11, 2026

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,413 128 Updated Nov 9, 2025

HBX-hbx / Dynamics-of-Zero-Shot-Generalization

Python 1 Updated Oct 18, 2024

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,838 108 Updated Mar 18, 2025

qiancheng0 / EscapeBench

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 18 1 Updated Dec 19, 2024

OpenBMB / UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Python 366 15 Updated Dec 29, 2023

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,214 51 Updated Nov 16, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,563 1,120 Updated Apr 3, 2026

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,904 369 Updated Dec 17, 2025

thunlp / Dynamics-of-Zero-Shot-Generalization

Code for the paper "The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning"

Python 5 Updated Apr 8, 2025

Open-Source-O1 / Open-O1

Python 1,346 53 Updated Nov 21, 2024

LC044 / WeChatMsg

40,849 4,962 Updated Dec 30, 2025

OpenBMB / Tell_Me_More

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 62 7 Updated Feb 20, 2024

WooooDyy / LLM-Agent-Paper-List

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8,098 491 Updated Sep 12, 2025

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,243 1,144 Updated Apr 4, 2026

Weixin-Liang / LLM-scientific-feedback

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 532 52 Updated Jan 11, 2024

liexusong / linux-source-code-analyze

Linux内核源码分析

1,637 356 Updated Sep 5, 2023

GoogleChrome / chrome-extensions-samples

Chrome Extensions Samples

JavaScript 17,441 9,001 Updated Mar 27, 2026

thunlp / OpenBackdoor

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python 202 27 Updated Apr 10, 2023

dibingfa / flash-linux0.11-talk

你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码

HTML 22,220 2,930 Updated Mar 22, 2025

Btlmd / IAI_Gen

What if you need more exercises?

Python 33 5 Updated Jul 16, 2024

sunface / rust-course

什么？你敢放心的把后背交给 AI? 我赌你不敢，那就来学学 AI 时代最安全的语言吧(Python无法战胜！)。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容，这可能是目前最用心的 Rust 中文学习教程 / Book

Rust 30,201 2,566 Updated Mar 12, 2026

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,164 10,891 Updated Mar 21, 2026

PKUFlyingPig / cs-self-learning

计算机自学指南

HTML 72,142 7,856 Updated Feb 24, 2026