Starred repositories
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Minimal reproduction of DeepSeek R1-Zero
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Infinite Photorealistic Worlds using Procedural Generation
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)
Schedule-Free Optimization in PyTorch
Official Repo for Open-Reasoner-Zero
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …
Unifying 3D Mesh Generation with Language Models
Pretraining and inference code for a large-scale depth-recurrent language model
[ICML2024] Official code for GaussianPro: 3D Gaussian Splatting with Progressive Propagation
LLM/VLM gaming agents and model evaluation through games.
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
IEEE CoG & NeurIPS workshop paper 'Counter-Strike Deathmatch with Large-Scale Behavioural Cloning'
Official Implementation of ECCV2024 paper: Chat Edit 3D: Interactive 3D Scene Editing via Large Language Model
AI-powered dialogue generation for Animal Crossing villagers using LLMs
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…
Python package to create manipulation scenes.
Baba Is You simulator using C++ with some reinforcement learning