Stars
Official code for the CVPR 2022 (oral) paper "Extracting Triangular 3D Models, Materials, and Lighting From Images".
This SDK is now deprecated, use the new unified Google GenAI SDK.
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
A simple, performant and scalable Jax LLM!
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx
Reinforcement Learning Environments for Omniverse Isaac Gym
Code for NeurIPS 2022 Paper, "Poisson Flow Generative Models" (PFGM)
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)
[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation
Implementation of RT1 (Robotic Transformer) in Pytorch
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
A Telegram bot to recommend arXiv papers
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
[ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
[NAACL 2025] Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"
[NeurlPS-2024] The official code of MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
[EMNLP 2024 Main] MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension