Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A toolkit for developing and comparing reinforcement learning algorithms.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Fully open reproduction of DeepSeek-R1
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🛠「Watt Toolkit」是一个开源跨平台的多功能 Steam 工具箱。
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Train transformer language models with reinforcement learning.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
PyTorch code and models for the DINOv2 self-supervised learning method.
Code release for NeRF (Neural Radiance Fields)
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling sa…
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
PyTorch implementations of deep reinforcement learning algorithms and environments
Solve Visual Understanding with Reinforced VLMs
CoTracker is a model for tracking any point (pixel) on a video.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN