Lists (3)
Sort Name ascending (A-Z)
Starred repositories
SynCity: Training-Free Generation of 3D Worlds
TAPIP3D: Tracking Any Point in Persistent 3D Geometry
A unified inference and post-training framework for accelerated video generation.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
The official implementation of the Paper: "StyleSculptor: Zero-Shot Style-Controllable 3D Asset Generation with Texture-Geometry Dual Guidance"
Modeling, training, eval, and inference code for OLMo
TripoSR: Fast 3D Object Reconstruction from a Single Image
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…
Official repo for: Epipolar Geometry Improves Video Generation Models
official repo for ArtiLatent (siggraph asia 2025)
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
[SIGGGRAPHASIA2025] PhySIC: Physically Plausible 3D Human-Scene Interaction and Contact from a Single Image
This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"
Trace Anything: Representing Any Video in 4D via Trajectory Fields
Repository of the paper "AnyUp: Universal Feature Upsampling".
[NeurIPS 2025] Pixel-Perfect Depth
[ICCV 2025] SuperDec: 3D Scene Decomposition with Superquadric Primitives.
Official code for "Learning 3D Garment Animation from Trajectories of A Piece of Cloth"
[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers