tsaoyu

Follow

🚀

Working

Tony Yu Cao tsaoyu

🚀

Working

Follow

LLM, Reinforcement Learning, Robotics

95 followers · 39 following

https://www.tsaoyu.com

Achievements

Achievements

Highlights

Pro

Organizations

Lists (1)

Sort

Distributed-computing

The future of distributed computing

Starred repositories

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,865 70 Updated Jun 22, 2025

obalcells / hallucination_probes

Real-Time Detection of Hallucinated Entities in Long-Form Generation

Python 278 27 Updated Nov 16, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 292 20 Updated Nov 7, 2025

samsja / muon_fsdp_2

Muon fsdp 2

Python 53 6 Updated Aug 8, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 4,201 543 Updated Feb 14, 2026

PiotrNawrot / sparse-frontier

The evaluation framework for training-free sparse attention in LLMs

Python 119 11 Updated Jan 27, 2026

Multiverse4FM / Multiverse-Engine

Customized Inference Engine for Multiverse Models

Python 24 2 Updated Jun 27, 2025

HazyResearch / cartridges

Storing long contexts in tiny caches with self-study

Python 239 30 Updated Dec 5, 2025

ScalingIntelligence / tokasaurus

Python 467 35 Updated Nov 25, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,833 216 Updated Feb 16, 2026

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,343 182 Updated Dec 17, 2025

transformerlab / transformerlab-app

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python 4,804 498 Updated Feb 16, 2026

microsoft / SeerAttention

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Python 193 17 Updated Sep 23, 2025

rdnfn / icai

Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.

Python 40 5 Updated Nov 19, 2025

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 9,758 1,126 Updated Jan 19, 2026

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 14,674 1,373 Updated Jan 31, 2026

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,596 187 Updated Feb 12, 2026

steel-dev / steel-browser

🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web without worrying about infrastructure.

TypeScript 6,427 917 Updated Feb 11, 2026

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 965 50 Updated Feb 5, 2026

dhealy05 / frames_of_mind

Animating R1's thoughts.

Python 384 11 Updated Feb 17, 2025

facebookresearch / LeanUniverse

LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management

Python 75 4 Updated Jan 15, 2025

aliyun / SimAI

C++ 810 137 Updated Dec 31, 2025

ServiceNow / BrowserGym

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,128 148 Updated Feb 10, 2026

zai-org / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 3,140 273 Updated Dec 5, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,523 306 Updated Nov 5, 2024

facefusion / facefusion

Industry leading face manipulation platform

Python 26,801 4,295 Updated Feb 16, 2026

alexrame / rewardedsoups

Rewarded soups official implementation

HTML 62 9 Updated Sep 27, 2023

openinterpreter / open-interpreter

A natural language interface for computers

Python 62,156 5,341 Updated Feb 9, 2026

hunterirving / macproxy_plus

browse the modern web on vintage computers

Python 206 16 Updated Dec 4, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,966 2,231 Updated Mar 11, 2025

Starred topics

gazebo

Robotics

sailing