Skip to content
View tsaoyu's full-sized avatar
🚀
Working
🚀
Working

Highlights

  • Pro

Organizations

@Maritime-Robotics-Student-Society @MaritimeRenewable @WRSC

Block or report tsaoyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,865 70 Updated Jun 22, 2025

Real-Time Detection of Hallucinated Entities in Long-Form Generation

Python 278 27 Updated Nov 16, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 292 20 Updated Nov 7, 2025

Muon fsdp 2

Python 53 6 Updated Aug 8, 2025

slime is an LLM post-training framework for RL Scaling.

Python 4,201 543 Updated Feb 14, 2026

The evaluation framework for training-free sparse attention in LLMs

Python 119 11 Updated Jan 27, 2026

Customized Inference Engine for Multiverse Models

Python 24 2 Updated Jun 27, 2025

Storing long contexts in tiny caches with self-study

Python 239 30 Updated Dec 5, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,833 216 Updated Feb 16, 2026

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,343 182 Updated Dec 17, 2025

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python 4,804 498 Updated Feb 16, 2026

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Python 193 17 Updated Sep 23, 2025

Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.

Python 40 5 Updated Nov 19, 2025

Agent S: an open agentic framework that uses computers like a human

Python 9,758 1,126 Updated Jan 19, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 14,674 1,373 Updated Jan 31, 2026

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,596 187 Updated Feb 12, 2026

🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web without worrying about infrastructure.

TypeScript 6,427 917 Updated Feb 11, 2026

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 965 50 Updated Feb 5, 2026

Animating R1's thoughts.

Python 384 11 Updated Feb 17, 2025

LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management

Python 75 4 Updated Jan 15, 2025
C++ 810 137 Updated Dec 31, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,128 148 Updated Feb 10, 2026

GLM-4-Voice | 端到端中英语音对话模型

Python 3,140 273 Updated Dec 5, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,523 306 Updated Nov 5, 2024

Industry leading face manipulation platform

Python 26,801 4,295 Updated Feb 16, 2026

Rewarded soups official implementation

HTML 62 9 Updated Sep 27, 2023

A natural language interface for computers

Python 62,156 5,341 Updated Feb 9, 2026

browse the modern web on vintage computers

Python 206 16 Updated Dec 4, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,966 2,231 Updated Mar 11, 2025
Next