Lists (9)
Sort Name ascending (A-Z)
Stars
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
verl: Volcano Engine Reinforcement Learning for LLMs
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
A MNIST-like fashion product database. Benchmark 👇
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
An elegant PyTorch deep reinforcement learning library.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Solve Visual Understanding with Reinforced VLMs
The perfect emulation setup to study and develop the Linux kernel, kernel modules, QEMU, gem5 and x86_64, ARMv7 and ARMv8 userland and baremetal assembly, ANSI C, C++ and POSIX. GDB step debug and …
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
A fast and simple implementation of learning algorithms for robotics.
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Official Repo for Open-Reasoner-Zero
An Open-source RL System from ByteDance Seed and Tsinghua AIR
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Stanford NLP Python library for Representation Finetuning (ReFT)
[CVPR2024 Highlight] VBench - We Evaluate Video Generation