Skip to content
View tsaoyu's full-sized avatar
🚀
Working
🚀
Working

Organizations

@Maritime-Robotics-Student-Society @MaritimeRenewable @WRSC

Block or report tsaoyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,894 71 Updated Jun 22, 2025

Real-Time Detection of Hallucinated Entities in Long-Form Generation

Python 289 30 Updated Nov 16, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 304 23 Updated Nov 7, 2025

Muon fsdp 2

Python 62 7 Updated Aug 8, 2025

slime is an LLM post-training framework for RL Scaling.

Python 6,656 960 Updated Jun 21, 2026

The evaluation framework for training-free sparse attention in LLMs

Python 123 12 Updated Jan 27, 2026

Customized Inference Engine for Multiverse Models

Python 25 2 Updated Jun 27, 2025

Storing long contexts in tiny caches with self-study

Python 276 38 Updated Mar 23, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,254 290 Updated Jun 22, 2026

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C 1,391 189 Updated Jun 15, 2026

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python 5,107 535 Updated Jun 20, 2026

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Python 204 20 Updated Jun 10, 2026

Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.

Python 41 7 Updated May 6, 2026

Agent S: an open agentic framework that uses computers like a human

Python 11,902 1,402 Updated May 13, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,533 1,565 Updated May 26, 2026

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 4,010 210 Updated Jun 22, 2026

🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web without worrying about infrastructure.

TypeScript 7,203 938 Updated Jun 9, 2026

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 1,006 53 Updated Feb 5, 2026

Animating R1's thoughts.

Python 380 11 Updated Feb 17, 2025

LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management

Python 77 5 Updated Jan 15, 2025
Python 1,014 176 Updated Apr 24, 2026

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,256 177 Updated Mar 17, 2026

GLM-4-Voice | 端到端中英语音对话模型

Python 3,194 281 Updated Dec 5, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,562 310 Updated Nov 5, 2024

Industry leading face manipulation platform

Python 29,021 4,719 Updated Jun 22, 2026

Rewarded soups official implementation

HTML 64 9 Updated Sep 27, 2023

A lightweight coding agent for open models like Deepseek, Kimi, and Qwen

Rust 64,085 5,556 Updated Jun 20, 2026

browse the modern web on vintage computers

Python 220 19 Updated Mar 18, 2026

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,657 2,311 Updated Apr 15, 2026
Next