eric-haibin-lin

🎯

Stealth mode…

haibin eric-haibin-lin

🎯

Stealth mode…

LLM systems.

984 followers · 182 following

Bytedance Seed
https://sites.google.com/view/haibinlin/
@eric_haibin_lin

Achievements

x4 x3

Achievements

x4 x3

Organizations

Stars

vllm-project / tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 351 213 Updated Jun 15, 2026

sgl-project / sglang-jax

JAX backend for SGL

Python 280 105 Updated Jun 15, 2026

stepfun-ai / StepMesh

C++ 367 41 Updated Jan 28, 2026

qlabs-eng / slowrun

100M tokens. Infinite compute. Lowest val loss wins.

Python 493 74 Updated Jun 15, 2026

rlops / rlix

Run more RL experiments. Wait less for GPUs.

Python 287 17 Updated May 24, 2026

deepseek-ai / EPLB

Expert Parallelism Load Balancer

Python 1,388 203 Updated Mar 24, 2025

huggingface / OpenEnv

An interface library for RL post training with environments.

Python 2,243 396 Updated Jun 13, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 174,662 21,136 Updated Jun 15, 2026

verl-project / verl-recipe

A set of examples based on verl for end-to-end RL training recipes.

Python 291 134 Updated Jun 9, 2026

apple / axlearn

An Extensible Deep Learning Library

Python 2,366 405 Updated May 16, 2026

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 422 18 Updated Jan 29, 2026

InternLM / InternBootcamp

Official implement on InternBootCamp

Python 349 27 Updated Jun 10, 2026

ctlllll / gpt-oss-reverse-engineering

Jupyter Notebook 71 2 Updated Aug 6, 2025

TsinghuaC3I / SSRL

SSRL: Self-Search Reinforcement Learning

Python 208 13 Updated Aug 20, 2025

google / tunix

A Lightweight LLM Post-Training Library

Python 2,343 309 Updated Jun 15, 2026

MiroMindAI / MiroRL

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 246 24 Updated Aug 27, 2025

tmlr-group / Co-rewarding

Forked from resistzzz/Co-rewarding

[ICLR 2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"

Python 57 1 Updated Feb 4, 2026

PRIME-RL / SimpleVLA-RL

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,730 113 Updated Jan 6, 2026

ByteDance-Seed / Seed-Prover

Lean 434 28 Updated Feb 13, 2026

axon-rl / gem

A Gym for Agentic LLMs

Python 494 33 Updated Jan 21, 2026

Tencent / digitalhuman

Python 348 49 Updated Jan 29, 2026

LLM360 / Reasoning360

A repo for open research on building large reasoning models

Python 148 19 Updated Mar 3, 2026

kxfan2002 / SophiaVL-R1

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Python 95 3 Updated Aug 8, 2025

ChengpengLi1003 / CoRT

Python 72 5 Updated Oct 23, 2025

real-absolute-AI / NoisyRollout

[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Python 110 3 Updated Sep 18, 2025

RM-R1-UIUC / RM-R1

[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 165 17 Updated Jun 26, 2025

InfiXAI / InfiGUI-R1

Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"

Python 65 5 Updated Dec 4, 2025

UCSB-NLP-Chang / ThinkPrune

Python 46 2 Updated Sep 27, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 17,310 1,515 Updated Apr 29, 2026

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 2,014 212 Updated Jun 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

haibin eric-haibin-lin

Achievements

Achievements

Organizations

Block or report eric-haibin-lin

Stars

vllm-project / tpu-inference

sgl-project / sglang-jax

stepfun-ai / StepMesh

qlabs-eng / slowrun

rlops / rlix

deepseek-ai / EPLB

huggingface / OpenEnv

anomalyco / opencode

verl-project / verl-recipe

apple / axlearn

Mini-o3 / Mini-o3

InternLM / InternBootcamp

ctlllll / gpt-oss-reverse-engineering

TsinghuaC3I / SSRL

google / tunix

MiroMindAI / MiroRL

tmlr-group / Co-rewarding

PRIME-RL / SimpleVLA-RL

ByteDance-Seed / Seed-Prover

axon-rl / gem

Tencent / digitalhuman

LLM360 / Reasoning360

kxfan2002 / SophiaVL-R1

ChengpengLi1003 / CoRT

real-absolute-AI / NoisyRollout

RM-R1-UIUC / RM-R1

InfiXAI / InfiGUI-R1

UCSB-NLP-Chang / ThinkPrune

microsoft / agent-lightning

ByteDance-Seed / VeOmni