Skip to content
View wangtong10086's full-sized avatar

Block or report wangtong10086

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

qqr is an RL training framework for open-ended agents.

Python 205 19 Updated Jan 21, 2026

Kubernetes compatible infrastructure for Affine

Python 10 23 Updated Feb 3, 2026

Internet-scale Neural Networks

Python 1,359 438 Updated Jan 31, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 829 96 Updated Feb 4, 2026

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 622 61 Updated Jan 28, 2026

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

Python 630 49 Updated Dec 2, 2025

Official repo for UAE

Python 162 5 Updated Dec 29, 2025

A framework for efficient model inference with omni-modality models

Python 2,591 374 Updated Feb 4, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,029 980 Updated Jul 8, 2025

An extremely fast Python type checker and language server, written in Rust.

Python 16,987 206 Updated Feb 3, 2026

The repo for In-context Autoencoder

Jupyter Notebook 165 20 Updated May 11, 2024

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 195 11 Updated May 31, 2024

Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression

Python 284 5 Updated Jan 27, 2026

Official PyTorch Implementation of "Flow Map Distillation Without Data"

Python 115 10 Updated Nov 25, 2025

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 131 12 Updated Dec 5, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,623 6,605 Updated Jan 22, 2026

Data mapping framework for rust stuff

Rust 44 4 Updated Feb 3, 2026

Tooling for exact and MinHash deduplication of large-scale text datasets

Rust 65 5 Updated Jan 15, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,514 969 Updated Feb 3, 2026

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,068 136 Updated Dec 8, 2025

Interactive visualizations of the geometric intuition behind diffusion models.

JavaScript 1,060 51 Updated Jan 31, 2026

🌚 🌍 🌝 GeoIP 规则文件加强版,支持自行定制 V2Ray dat 格式文件 geoip.dat、MaxMind mmdb 格式文件、sing-box SRS 格式文件、mihomo MRS 格式文件、Clash ruleset、Surge ruleset 等。Enhanced edition of GeoIP files for V2Ray, Xray-core, sing-box,…

Go 5,582 840 Updated Feb 2, 2026

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,348 3,365 Updated Feb 4, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,365 253 Updated Feb 4, 2026

Minimal reproduction of OneRec

Python 982 141 Updated Feb 1, 2026

Native Multimodal Models are World Learners

Python 1,445 54 Updated Dec 30, 2025

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 445 37 Updated Jan 26, 2026

High-throughput tensor loading for PyTorch

Python 221 14 Updated Jan 22, 2026

The best ChatGPT that $100 can buy.

Python 42,062 5,432 Updated Feb 4, 2026
Next