High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,029 980 Updated Jul 8, 2025

astral-sh / ty

An extremely fast Python type checker and language server, written in Rust.

Python 16,987 206 Updated Feb 3, 2026

getao / icae

The repo for In-context Autoencoder

Jupyter Notebook 165 20 Updated May 11, 2024

ucaslcl / Fox

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 195 11 Updated May 31, 2024

liufanfanlff / C3-Context-Cascade-Compression

Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression

Python 284 5 Updated Jan 27, 2026

ShangyuanTong / FreeFlow

Official PyTorch Implementation of "Flow Map Distillation Without Data"

Python 115 10 Updated Nov 25, 2025

mit-han-lab / fastrl

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 131 12 Updated Dec 5, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,623 6,605 Updated Jan 22, 2026

allenai / datamap-rs

Data mapping framework for rust stuff

Rust 44 4 Updated Feb 3, 2026

allenai / duplodocus

Tooling for exact and MinHash deduplication of large-scale text datasets

Rust 65 5 Updated Jan 15, 2026

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,514 969 Updated Feb 3, 2026

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,068 136 Updated Dec 8, 2025

helblazer811 / Diffusion-Explorer

Interactive visualizations of the geometric intuition behind diffusion models.

JavaScript 1,060 51 Updated Jan 31, 2026

Loyalsoldier / geoip

🌚 🌍 🌝 GeoIP 规则文件加强版，支持自行定制 V2Ray dat 格式文件 geoip.dat、MaxMind mmdb 格式文件、sing-box SRS 格式文件、mihomo MRS 格式文件、Clash ruleset、Surge ruleset 等。Enhanced edition of GeoIP files for V2Ray, Xray-core, sing-box,…

Go 5,582 840 Updated Feb 2, 2026

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,348 3,365 Updated Feb 4, 2026

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,365 253 Updated Feb 4, 2026

AkaliKong / MiniOneRec

Minimal reproduction of OneRec

Python 982 141 Updated Feb 1, 2026

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,445 54 Updated Dec 30, 2025

tensorgi / TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 445 37 Updated Jan 26, 2026

fal-ai / flashpack

High-throughput tensor loading for PyTorch

Python 221 14 Updated Jan 22, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 42,062 5,432 Updated Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

魑魅魍魉 wangtong10086

Block or report wangtong10086

Stars

Alibaba-NLP / qqr

AffineFoundation / affinetes

opentensor / bittensor

radixark / miles

open-tinker / OpenTinker

IQuestLab / IQuest-Coder-V1

google-deepmind / disco_rl

WeichenFan / UAE

vllm-project / vllm-omni

vwxyzjn / cleanrl