Skip to content
View xiusic's full-sized avatar

Highlights

  • Pro

Block or report xiusic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Repository for our paper: AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

24 Updated Jun 5, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,933 109,957 Updated Jun 8, 2026

Agentic Coding for Builders who Ship

Rust 9,822 7,782 Updated Jun 11, 2026

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,505 306 Updated Apr 10, 2026
Python 34 3 Updated Aug 26, 2025

[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 165 17 Updated Jun 26, 2025

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 533 53 Updated Jan 11, 2024

Our library for RL environments + evals

Python 4,200 561 Updated Jun 17, 2026

My learning notes for ML SYS.

Python 6,532 443 Updated Jun 8, 2026

[ICLR'24 spotlight] Tool-Augmented Reward Modeling

Python 55 1 Updated Jun 6, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,827 83 Updated May 11, 2025
Jupyter Notebook 4 Updated May 4, 2025

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 82 3 Updated Jul 18, 2025

s1: Simple test-time scaling

Python 6,656 756 Updated Jun 25, 2025

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,011 4,086 Updated Jun 16, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,623 577 Updated Jun 17, 2026

Replicating the Illinois letterhead in latex

TeX 50 19 Updated Apr 4, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,651 969 Updated Jun 9, 2026

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,846 131 Updated Jan 17, 2025

🙌 OpenHands: AI-Driven Development

Python 77,431 9,840 Updated Jun 17, 2026

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 376 21 Updated Jun 11, 2024

MATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)

Python 117 18 Updated Apr 2, 2024

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)

Python 59 2 Updated Apr 2, 2024

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains (AAAI'24)

Python 8 1 Updated Apr 2, 2024

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)

Python 11 Updated Aug 24, 2024

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study (WWW'23)

C++ 67 3 Updated May 27, 2023

Minimally Supervised Categorization of Text with Metadata (SIGIR'20)

Python 47 3 Updated Apr 2, 2024

Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)

Python 32 5 Updated Jun 21, 2025
Next