Skip to content
View YX-S-Z's full-sized avatar

Block or report YX-S-Z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

/skills to route dexterous hand policies

Python 6 Updated May 14, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,556 6,677 Updated Jun 23, 2026

Generate any location from the real world in Minecraft with a high level of detail.

Rust 16,203 1,352 Updated Jun 16, 2026

Poker simulator and solver

Python 1 Updated Feb 20, 2026
Python 10 1 Updated Feb 25, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 380,090 79,591 Updated Jun 23, 2026

A live stream development of RL tunning for LLM agents

Python 4,105 578 Updated May 5, 2026

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 329 19 Updated Apr 28, 2025

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024
Python 1 1 Updated Sep 2, 2024

Entropy Based Sampling and Parallel CoT Decoding

Python 3,434 320 Updated Nov 13, 2024

Estimating Body and Hand Motion in an Ego-sensed World

Python 282 28 Updated Sep 3, 2025
Python 86 4 Updated May 28, 2024

MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)

Python 101 14 Updated Jul 16, 2024

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python 3,505 749 Updated Jun 26, 2024

Train transformer language models with reinforcement learning.

Python 18,692 2,800 Updated Jun 23, 2026

An Extensible Deep Learning Library

Python 2,367 406 Updated May 16, 2026
Jupyter Notebook 8 Updated Sep 7, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 60,047 10,344 Updated Nov 12, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,675 973 Updated Jun 17, 2026

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,957 484 Updated Jun 10, 2026

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 2,005 138 Updated Nov 7, 2025

LLM101n: Let's build a Storyteller

37,369 2,053 Updated Aug 1, 2024

A simple library for scaling up JAX programs

Python 148 11 Updated Nov 4, 2025

Minimal but scalable implementation of large language models in JAX

Python 34 5 Updated Nov 28, 2025

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 1,209 109 Updated May 31, 2024

Grok open release

Python 51,691 8,472 Updated Aug 30, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 12,079 1,364 Updated Jun 22, 2026

Code for CRATE (Coding RAte reduction TransformEr).

Python 1,274 98 Updated Oct 23, 2024
Next