A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparascopic Video

Maximilian Fehrentz*^1,2,4 · Nicolas Stellwag*² · Robert Wiebe² · Nicole Thorisch² · Fabian Grob² · Patrick Remerscheid² · Ken-Joel Simmoteit² · Benjamin D. Killeen^1,4 · Christian Heiliger³ · Nassir Navab^1,4

¹Computer Aided Medical Procedures, TU Munich

²TUM.ai

³University Hospital of Ludwig Maximilian University (LMU) Munich

⁴Munich Center for Machine Learning

Project Page | Paper | Dataset

Official implementation of "A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparascopic Video".

Installation & Setup

1. Install pixi

Install the pixi package manger using the official instructions.

Note: This installs pixi into the user's home directory and python environments will later be placed in this project directory. So there should be no issues on compute clusters.

2. Clone repository

git clone --recurse-submodules git@github.com:tum-ai/surg4d.git

3. Setup python environment

# install conda and pypi packages
pixi install

# install custom packages and download checkpoints
pixi run setup

# optional: test importing key packages
pixi run test-install

4. Download dataset and annotations

Download the CholecSeg8k dataset and our annotations:

pixi run download-cholecseg8k
pixi run download-benchmark-annotations

Note: If you want to annotate your own queries, check out our annotation tool repository.

Usage

The pipeline is based on the configuration system hydra. Config files can be found in conf/. Check out the hydra getting started guide.

Run the pipeline using the following scripts:

# train segmentation model and create masks
pixi run python segment.py

# preprocess frames, masks, and annotations
pixi run python preprocess.py

# predict depth and pose
pixi run python extract_geometry.py

# create temporally consistent instances
pixi run python track_objects.py

# build 4d scene graphs
pixi run python extract_graphs.py

# predict benchmark queries
pixi run python evaluate_benchmark.py

# compute benchmark metrics
pixi run python compute_metrics.py

Or the whole pipeline, including all ablations:

bash ablate_all.sh

📧 Contact

For questions, please open an issue or contact maximilian.fehrentz@tum.de.

Name		Name	Last commit message	Last commit date
Latest commit History 324 Commits
benchmark		benchmark
conf		conf
llm		llm
project_page		project_page
submodules		submodules
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
ablate_all.sh		ablate_all.sh
compute_metrics.py		compute_metrics.py
evaluate_benchmark.py		evaluate_benchmark.py
extract_geometry.py		extract_geometry.py
extract_graphs.py		extract_graphs.py
index.html		index.html
pixi.lock		pixi.lock
pixi.toml		pixi.toml
preprocess.py		preprocess.py
segment.py		segment.py
track_objects.py		track_objects.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparascopic Video

Project Page | Paper | Dataset

Installation & Setup

1. Install pixi

2. Clone repository

3. Setup python environment

4. Download dataset and annotations

Usage

📧 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparascopic Video

Project Page | Paper | Dataset

Installation & Setup

1. Install pixi

2. Clone repository

3. Setup python environment

4. Download dataset and annotations

Usage

📧 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages