Starred repositories
A generative world for general-purpose robotics & embodied AI learning.
collection of diffusion model papers categorized by their subareas
Infinite Photorealistic Worlds using Procedural Generation
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Pretraining and inference code for a large-scale depth-recurrent language model
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
AI-powered dialogue generation for Animal Crossing villagers using LLMs
LLM/VLM gaming agents and model evaluation through games.
Source code for the X Recommendation Algorithm
[ICML2024] Official code for GaussianPro: 3D Gaussian Splatting with Progressive Propagation
[CVPR 2025] Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Python package to create manipulation scenes.
Official Implementation of ECCV2024 paper: Chat Edit 3D: Interactive 3D Scene Editing via Large Language Model
Official Repo for Open-Reasoner-Zero
Schedule-Free Optimization in PyTorch
Original implementation of "Radiant Foam: Real-Time Differentiable Ray Tracing"
Open-Sora: Democratizing Efficient Video Production for All
Minimal reproduction of DeepSeek R1-Zero
Unifying 3D Mesh Generation with Language Models
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …