Starred repositories
This repo contains the Hugging Face Deep Reinforcement Learning Course.
A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer
Interactive Pytorch forward pass visualization in notebooks
Scalable Multi-Agent RL Training School for Autonomous Driving
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
A guide to help developers get up and running quickly with the OpenCL programming framework
Reference PyTorch implementation and models for DINOv3
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
3D Edge Mapping using Edge-Specialized Gaussian Splatting
Pytorch implementation of our paper "CLRNet: Cross Layer Refinement Network for Lane Detection" (CVPR2022 Acceptance).
The simplest, fastest repository for training/finetuning small-sized VLMs.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Implementing DeepSeek R1's GRPO algorithm from scratch
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A toy implementation of a diffusion model for low-dimensional data
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
(ICCV 2025) GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
✨✨Latest Advances on Multimodal Large Language Models
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Benchmark and model for step-by-step reasoning in autonomous driving.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks