yinjjiew

Yinjie Wang yinjjiew

PhD at UChicago, Agentic RL / Post-training

158 followers · 1 following

Achievements

Highlights

Organizations

Stars

huggggoooooo / ProtoCycle

Official implementation of ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design.

Python 8 1 Updated Apr 21, 2026

claw-eval / claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 569 50 Updated May 17, 2026

harbor-framework / terminal-bench-2

Shell 236 76 Updated Apr 30, 2026

harbor-framework / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 2,213 514 Updated Jan 22, 2026

opendatalab / MinerU-Diffusion

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

Python 590 35 Updated Apr 20, 2026

ReTool-RL / ReTool

Python 355 27 Updated Aug 12, 2025

aiming-lab / MetaClaw

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Python 3,384 440 Updated Apr 11, 2026

Danau5tin / terminal-bench-rl

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 383 26 Updated Aug 24, 2025

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 876 94 Updated Feb 18, 2026

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,340 210 Updated May 17, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,329 580 Updated May 12, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 372,614 77,237 Updated May 17, 2026

TIGER-AI-Lab / OpenResearcher

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Python 754 75 Updated Apr 16, 2026

areal-project / AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,181 496 Updated May 16, 2026

Snowflake-Labs / agent-world-model

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Python 352 40 Updated May 1, 2026

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,959 862 Updated Apr 1, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,707 796 Updated May 14, 2026

meituan / EvoCUA

EvoCUA: Evolving Computer Use Agent

Python 317 22 Updated Mar 31, 2026

Computer-use-agents / dart-gui

DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

Python 92 6 Updated Feb 26, 2026

bytedance / UI-TARS

Pioneering Automated GUI Interaction with Native Agents

Python 10,614 791 Updated Jan 27, 2026

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,500 256 Updated Apr 15, 2026

hao-ai-lab / d3LLM

[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 127 8 Updated May 1, 2026

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,783 1,944 Updated Mar 17, 2026

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,618 12,998 Updated May 15, 2026

Gen-Verse / LatentMAS

[ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems

Python 950 142 Updated May 1, 2026

Cominclip / OmniVerifier

[ICLR 2026 Oral] Generative Universal Verifier as Multimodal Meta-Reasoner

Python 58 6 Updated Nov 14, 2025

mit-han-lab / flash-moba

C++ 248 10 Updated Nov 19, 2025

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 2,506 263 Updated Apr 15, 2026

cambrian-mllm / cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

Python 544 19 Updated Apr 3, 2026

THUDM / AgentRL

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 286 22 Updated Jan 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yinjie Wang yinjjiew

Achievements

Achievements

Highlights

Organizations

Block or report yinjjiew

Stars

huggggoooooo / ProtoCycle

claw-eval / claw-eval

harbor-framework / terminal-bench-2

harbor-framework / terminal-bench

opendatalab / MinerU-Diffusion

ReTool-RL / ReTool

aiming-lab / MetaClaw

Danau5tin / terminal-bench-rl

lasgroup / SDPO

radixark / miles

Gen-Verse / OpenClaw-RL

openclaw / openclaw

TIGER-AI-Lab / OpenResearcher

areal-project / AReaL

Snowflake-Labs / agent-world-model

SWE-bench / SWE-bench

THUDM / slime

meituan / EvoCUA

Computer-use-agents / dart-gui

bytedance / UI-TARS

thu-ml / TurboDiffusion

hao-ai-lab / d3LLM

Wan-Video / Wan2.2

alshedivat / al-folio

Gen-Verse / LatentMAS

Cominclip / OmniVerifier

mit-han-lab / flash-moba

ZHZisZZ / dllm

cambrian-mllm / cambrian-s

THUDM / AgentRL