Skip to content
View eric-haibin-lin's full-sized avatar
🎯
Stealth mode…
🎯
Stealth mode…

Organizations

@apache @cmu-db @dmlc

Block or report eric-haibin-lin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 351 213 Updated Jun 15, 2026

JAX backend for SGL

Python 280 105 Updated Jun 15, 2026
C++ 367 41 Updated Jan 28, 2026

100M tokens. Infinite compute. Lowest val loss wins.

Python 493 74 Updated Jun 15, 2026

Run more RL experiments. Wait less for GPUs.

Python 287 17 Updated May 24, 2026

Expert Parallelism Load Balancer

Python 1,388 203 Updated Mar 24, 2025

An interface library for RL post training with environments.

Python 2,243 396 Updated Jun 13, 2026

The open source coding agent.

TypeScript 174,662 21,136 Updated Jun 15, 2026

A set of examples based on verl for end-to-end RL training recipes.

Python 291 134 Updated Jun 9, 2026

An Extensible Deep Learning Library

Python 2,366 405 Updated May 16, 2026

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 422 18 Updated Jan 29, 2026

Official implement on InternBootCamp

Python 349 27 Updated Jun 10, 2026
Jupyter Notebook 71 2 Updated Aug 6, 2025

SSRL: Self-Search Reinforcement Learning

Python 208 13 Updated Aug 20, 2025

A Lightweight LLM Post-Training Library

Python 2,343 309 Updated Jun 15, 2026

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 246 24 Updated Aug 27, 2025

[ICLR 2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"

Python 57 1 Updated Feb 4, 2026

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,730 113 Updated Jan 6, 2026

A Gym for Agentic LLMs

Python 494 33 Updated Jan 21, 2026
Python 348 49 Updated Jan 29, 2026

A repo for open research on building large reasoning models

Python 148 19 Updated Mar 3, 2026

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Python 95 3 Updated Aug 8, 2025
Python 72 5 Updated Oct 23, 2025

[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Python 110 3 Updated Sep 18, 2025

[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 165 17 Updated Jun 26, 2025

Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"

Python 65 5 Updated Dec 4, 2025
Python 46 2 Updated Sep 27, 2025

The absolute trainer to light up AI agents.

Python 17,310 1,515 Updated Apr 29, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 2,014 212 Updated Jun 15, 2026
Next