Stars
Wan: Open and Advanced Large-Scale Video Generative Models
Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)
A web-based collaborative LaTeX editor
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
[CVPR 2025] Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation".
This repository is an implementation of the NeurIPS 2024 paper "LoD-Loc: Visual Localization using LoD 3D Map with Neural Wireframe Alignment".
[ICCV 2023] Deep Active Contours for Real-time 6-DoF Object Tracking
OpenXRLab XRAPI is an open-source implementation of the Google ARCore and Apple ARKit
OpenXRLab Multi-Modal Motion Generation Toolbox and Benchmark
OpenXRLab foundational library for XR-related algorithms
OpenXRLab Structure-from-Motion Toolbox and Benchmark
OpenXRLab Visual Localization Toolbox and Server
OpenXRLab Synthetic Data Rendering Toolbox
OpenXRLab Multi-view Motion Capture Toolbox and Benchmark
OpenXRLab Visual-inertial SLAM Toolbox and Benchmark
OpenXRLab Neural Radiance Field (NeRF) Toolbox and Benchmark
Given an input mesh, computes the F-score. This assumes that an appropriate path to the ground truth mesh is available.