- Oxford
Highlights
- Pro
Stars
Official repo for DAD-3DHeads: A Large-scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image (CVPR 2022).
Infinite Photorealistic Worlds using Procedural Generation
Official repo for S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
[IV 2025, Oral] Official code of "LiDPM: Rethinking Point Diffusion for Lidar Scene Completion"
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
[NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
This repository provides the official PyTorch implementation of the paper: MaskFactory: Towards High-quality Synthetic Data Generation For Dichotomous Image Segmentation
This repository is the code of paper 'DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark'.
[ICCV 2025] DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness
Implementation of the paper "DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients"
[CVPR 2024] Shadows Don’t Lie and Lines Can’t Bend! Generative Models don’t know Projective Geometry...for now
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception
Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..
[ECCV 2024] Official Repo for: Dataset Enhancement with Instance-Level Augmentations
Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
A playbook for systematically maximizing the performance of deep learning models.
Towards Unified Keyframe Propagation Models
Homography Decomposition Networks for Planar Object Tracking
Official repo for FEAR: Fast, Efficient, Accurate and Robust Visual Tracker (ECCV 2022)
Contains source code for the winning solution of the xView3 challenge https://iuu.xview.us/.
implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)
Official Python toolkit for generic object tracking benchmark GOT-10k and beyond
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image