Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
The official Python library for the OpenAI API
A generative world for general-purpose robotics & embodied AI learning.
Fully open reproduction of DeepSeek-R1
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
PyTorch implementations of Generative Adversarial Networks.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Common used path planning algorithms with animations.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
OpenMMLab's next-generation platform for general 3D object detection.
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Count the MACs / FLOPs of your PyTorch model.
Denoising Diffusion Probabilistic Models
SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Release for Improved Denoising Diffusion Probabilistic Models
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Visual tracking library based on PyTorch.
Robotics Toolbox for Python