Skip to content
View HBX-hbx's full-sized avatar

Highlights

  • Pro

Block or report HBX-hbx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 240 9 Updated Feb 5, 2026

The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents"

Python 27 Updated Dec 10, 2025

Awesome Open-ended AI

399 40 Updated Feb 12, 2026

The official repository for the dataset FactualBench, which is introduced in paper "Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization".

Python 3 Updated Dec 30, 2025

My learning notes for ML SYS.

Python 5,356 348 Updated Jan 30, 2026

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,637 542 Updated Feb 11, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,331 129 Updated Nov 9, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,805 103 Updated Mar 18, 2025

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 18 1 Updated Dec 19, 2024

A large-scale, fine-grained, diverse preference dataset (and models).

Python 363 15 Updated Dec 29, 2023

A bibliography and survey of the papers surrounding o1

TeX 1,211 51 Updated Nov 16, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,346 1,109 Updated Feb 7, 2026

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,885 368 Updated Dec 17, 2025

Code for the paper "The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning"

Python 5 Updated Apr 8, 2025
Python 1,344 53 Updated Nov 21, 2024

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 61 7 Updated Feb 20, 2024

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8,062 491 Updated Sep 12, 2025

LaTeX Thesis Template for Tsinghua University

TeX 5,152 1,138 Updated Jan 4, 2026

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 530 51 Updated Jan 11, 2024

Linux内核源码分析

1,629 353 Updated Sep 5, 2023

Chrome Extensions Samples

JavaScript 17,353 8,962 Updated Jan 26, 2026

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python 200 27 Updated Apr 10, 2023

你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码

HTML 22,048 2,914 Updated Mar 22, 2025

What if you need more exercises?

Python 31 4 Updated Jul 16, 2024

“连续八年成为全世界最受喜爱的语言,无 GC 也无需手动内存管理、极高的性能和安全性、过程/OO/函数式编程、优秀的包管理、JS 未来基石" — 工作之余的第二语言来试试 Rust 吧。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,这可能是目前最用心的 Rust 中文学习教程 / Book

Rust 29,949 2,545 Updated Jan 29, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 97,983 10,822 Updated Feb 12, 2026

计算机自学指南

HTML 71,324 7,835 Updated Feb 2, 2026

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 1 Updated Jan 1, 2021
Next