Stars
This is the implementation of official PECNet used for the Reproducibility Challenge 2020
[CVPR2022] Code for CVPR 2022 paper "Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion"
State-of-the-art methods for human trajectory forecasting. Contains code for papers published at ECCV 2020 and ICCV 2021.
[ECCV2022] SocialVAE: Human Trajectory Prediction using Timewise Latents
A dataset for benchmarking 2D Trackers in Minimally Invasive Surgery (MIS)
Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.
Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"
[MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"
[ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment
Official code of the paper "EgoExOR: An Egocentric–Exocentric Operating Room Dataset for Comprehensive Understanding of Surgical Activities" submitted at NeurIPS 2025 Datasets & Benchmarks Track.
[MICCAI 2024] Official dataset release for "EgoSurgery: A Dataset for Surgical Video Understanding from Egocentric Open Surgery Videos"
Official code of the paper LABRAD-OR: Lightweight Memory Scene Graphs for Accurate Bimodal Reasoning in Dynamic Operating Rooms accepted at MICCAI 2023.
Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments accepted at CVPR 2025. This repo includes both the dat…
Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.
Application for classifying out-of-body images in endoscopic videos
Indexity is a web-based tool designed for medical video annotation in surgical data science projects.
Multi-View Operating Room (MVOR) dataset consists of synchronized multi-view frames recorded by three RGB-D cameras in a hybrid OR during real clinical interventions. We provide camera calibration …
There are compilations of surgery-related tasks, datasets, and papers.
Dataset for multi-perspective surgical tool tracking
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…