Skip to content
View ChangshuoShen's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ChangshuoShen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,387 4,342 Updated Feb 6, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,505 283 Updated Feb 6, 2026

[ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping".

Python 163 13 Updated May 2, 2025

Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)

Python 142 14 Updated Apr 7, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,391 4,778 Updated Jun 2, 2025

一个开源的GPU服务器管理平台;可以实时查看模型训练状态、GPU资源占用、模型训练日志、IP访问记录等

JavaScript 39 6 Updated Apr 19, 2024

Homepage for An

HTML 1 Updated Dec 9, 2025

Minimal reproduction of OneRec

Python 1,000 142 Updated Feb 1, 2026

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 185 22 Updated May 25, 2025

collecting publicly available distillation datasets based on DepSeek-R1

25 1 Updated Mar 12, 2025

Fast and memory-efficient exact attention

Python 22,117 2,352 Updated Feb 5, 2026

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Python 64 1 Updated Sep 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1 1 Updated Oct 31, 2025

A Unified Framework for High-Performance and Extensible LLM Steering

Python 163 14 Updated Feb 5, 2026

[COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free

Python 51 4 Updated Apr 6, 2025

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]

Python 179 8 Updated Jul 8, 2025

Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"

Python 339 26 Updated Nov 13, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

Python 1 Updated May 22, 2025

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Shell 145 6 Updated Nov 2, 2024

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 532 21 Updated Jan 4, 2026

[NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning

Python 123 5 Updated Dec 13, 2025

USTC 大数据系统及综合实验大作业

Python 6 Updated Dec 31, 2024

personal web

HTML 1 Updated Sep 8, 2025

Automated tool for running Python programs in a streamlined manner

JavaScript 356 21 Updated Jan 12, 2026

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Python 68 Updated Jul 24, 2025

[ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can be What Recommenders Need: Findings and Potentials"

Python 97 6 Updated May 16, 2025

[SIGIR 2024 perspective] The implementation of paper "On Generative Agents in Recommendation"

Python 456 54 Updated Jul 7, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,027 3,202 Updated Feb 6, 2026
Next