Skip to content
View yinjjiew's full-sized avatar

Highlights

  • Pro

Organizations

@Gen-Verse

Block or report yinjjiew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design.

Python 8 1 Updated Apr 21, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 569 50 Updated May 17, 2026

A benchmark for LLMs on complicated tasks in the terminal

Python 2,213 514 Updated Jan 22, 2026

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

Python 590 35 Updated Apr 20, 2026
Python 355 27 Updated Aug 12, 2025

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Python 3,384 440 Updated Apr 11, 2026

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 383 26 Updated Aug 24, 2025

Reinforcement Learning via Self-Distillation (SDPO)

Python 876 94 Updated Feb 18, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,340 210 Updated May 17, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,329 580 Updated May 12, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 372,614 77,237 Updated May 17, 2026

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Python 754 75 Updated Apr 16, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,181 496 Updated May 16, 2026

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Python 352 40 Updated May 1, 2026

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,959 862 Updated Apr 1, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,707 796 Updated May 14, 2026

EvoCUA: Evolving Computer Use Agent

Python 317 22 Updated Mar 31, 2026

DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

Python 92 6 Updated Feb 26, 2026

Pioneering Automated GUI Interaction with Native Agents

Python 10,614 791 Updated Jan 27, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,500 256 Updated Apr 15, 2026

[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 127 8 Updated May 1, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,783 1,944 Updated Mar 17, 2026

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,618 12,998 Updated May 15, 2026

[ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems

Python 950 142 Updated May 1, 2026

[ICLR 2026 Oral] Generative Universal Verifier as Multimodal Meta-Reasoner

Python 58 6 Updated Nov 14, 2025

dLLM: Simple Diffusion Language Modeling

Python 2,506 263 Updated Apr 15, 2026

Cambrian-S: Towards Spatial Supersensing in Video

Python 544 19 Updated Apr 3, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 286 22 Updated Jan 17, 2026
Next