YX-S-Z

SimonZhai YX-S-Z

This guy is too lazy to describe himself

30 followers · 13 following

https://yx-s-z.github.io/

Achievements

x3 x2

Achievements

x3 x2

Stars

DexHoldem / DexHoldemSKills

/skills to route dexterous hand policies

Python 6 Updated May 14, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,556 6,677 Updated Jun 23, 2026

louis-e / arnis

Generate any location from the real world in Minecraft with a high level of detail.

Rust 16,203 1,352 Updated Jun 16, 2026

ShumaoZ / poker

Poker simulator and solver

Python 1 Updated Feb 20, 2026

YX-S-Z / texas-holdem-arena

Python 10 1 Updated Feb 25, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 380,090 79,591 Updated Jun 23, 2026

OpenManus / OpenManus-RL

A live stream development of RL tunning for LLM agents

Python 4,105 578 Updated May 5, 2026

LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 329 19 Updated Apr 28, 2025

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024

FengdiC / OTTD

Python 1 1 Updated Sep 2, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,434 320 Updated Nov 13, 2024

brentyi / egoallo

Estimating Body and Hand Motion in an Ego-sensed World

Python 282 28 Updated Sep 3, 2025

rail-berkeley / fmb

Python 86 4 Updated May 28, 2024

moka-manipulation / moka

MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)

Python 101 14 Updated Jul 16, 2024

datamllab / rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python 3,505 749 Updated Jun 26, 2024

google-deepmind / alphageometry

Python 4,866 577 Updated Jan 13, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,692 2,800 Updated Jun 23, 2026

apple / axlearn

An Extensible Deep Learning Library

Python 2,367 406 Updated May 16, 2026

efrick2002 / Starling

Jupyter Notebook 8 Updated Sep 7, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 60,047 10,344 Updated Nov 12, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,675 973 Updated Jun 17, 2026

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,957 484 Updated Jun 10, 2026

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 2,005 138 Updated Nov 7, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

37,369 2,053 Updated Aug 1, 2024

young-geng / scalax

A simple library for scaling up JAX programs

Python 148 11 Updated Nov 4, 2025

young-geng / mintext

Minimal but scalable implementation of large language models in JAX

Python 34 5 Updated Nov 28, 2025

Genesis-Embodied-AI / RoboGen

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 1,209 109 Updated May 31, 2024

xai-org / grok-1

Grok open release

Python 51,691 8,472 Updated Aug 30, 2024

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 12,079 1,364 Updated Jun 22, 2026

Ma-Lab-Berkeley / CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Python 1,274 98 Updated Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SimonZhai YX-S-Z

Achievements

Achievements

Block or report YX-S-Z

Stars

DexHoldem / DexHoldemSKills

sgl-project / sglang

louis-e / arnis

ShumaoZ / poker

YX-S-Z / texas-holdem-arena

openclaw / openclaw

OpenManus / OpenManus-RL

LeslieTrue / SFTvsRL

RL4VLM / RL4VLM

FengdiC / OTTD

xjdr-alt / entropix

brentyi / egoallo

rail-berkeley / fmb

moka-manipulation / moka

datamllab / rlcard

google-deepmind / alphageometry

huggingface / trl

apple / axlearn

efrick2002 / Starling

karpathy / nanoGPT

OpenRLHF / OpenRLHF

xlang-ai / OSWorld

cambrian-mllm / cambrian

karpathy / LLM101n

young-geng / scalax

young-geng / mintext

Genesis-Embodied-AI / RoboGen

xai-org / grok-1

Farama-Foundation / Gymnasium

Ma-Lab-Berkeley / CRATE