Skip to content
View longxudou's full-sized avatar

Organizations

@HIT-SCIR @sail-sg @sea-sailor @terminal-agent

Block or report longxudou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI demo for playing ARPG/Soul-like game with RL frame

Python 398 71 Updated Sep 24, 2024

AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.

Python 954 122 Updated Nov 18, 2023

The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"

Python 34 Updated Mar 26, 2026

💻 Terminal-Agent with Human-in-the-Loop Learning

Python 39 2 Updated Jan 16, 2026

Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training

Python 67 13 Updated Jul 28, 2025

The official repository for "Rongsheng Wang's Arxiv Template"

TeX 61 8 Updated May 7, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,993 351 Updated Jun 13, 2026

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

280 12 Updated Jun 5, 2026

Defeating the Training-Inference Mismatch via FP16

Python 195 17 Updated Nov 14, 2025

[ICML'26] Scaling Long-Horizon LLM Agent via Context-Folding

Python 161 11 Updated May 18, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,109 891 Updated Jun 13, 2026
C 15 Updated Oct 13, 2025

User Profile-Based Long-Term Memory for AI Chatbot Applications.

Python 2,753 219 Updated Jan 11, 2026

A tool for exploring each layer in a docker image

Go 54,220 1,980 Updated Dec 15, 2025

Docker image registry for SWE-bench, created by Epoch AI.

Python 18 1 Updated Aug 21, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,739 153 Updated Jun 11, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,875 600 Updated Jun 13, 2026

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

350 16 Updated Jan 22, 2026

☑️ A simple and extensible shell script for managing your todo.txt file.

Shell 6,118 737 Updated Nov 24, 2025

A Tool to Visualize Claude Code's LLM Interactions

JavaScript 2,385 407 Updated Aug 26, 2025

The official github repo for "Diffusion Language Models are Super Data Learners".

Python 228 8 Updated Nov 6, 2025

A Gym for Agentic LLMs

Python 494 33 Updated Jan 21, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 105,239 14,053 Updated Jun 13, 2026

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

TypeScript 167 8 Updated Nov 6, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,261 59 Updated Aug 27, 2025

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 269 18 Updated Jul 8, 2025

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 674 120 Updated Jun 8, 2026

Create knowledge graphs with LLMs

Jupyter Notebook 508 31 Updated Oct 11, 2025

An incremental parsing system for programming tools

Rust 25,815 2,693 Updated Jun 13, 2026

OpenAI Frontier Evals

Python 1,220 162 Updated Apr 21, 2026
Next