Stars
High-Resolution Image Synthesis with Latent Diffusion Models
Code release for NeRF (Neural Radiance Fields)
A unified framework for 3D content generation.
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
[ICCV 2019] Monocular depth estimation from a single image
This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
High Quality Monocular Depth Estimation via Transfer Learning
Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains
KITTI Object Visualization (Birdview, Volumetric LiDar point cloud )
Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral
D2-Net: A Trainable CNN for Joint Description and Detection of Local Features
The ApolloScape Open Dataset for Autonomous Driving and its Application.
A Scalable Pipeline for Making Steerable Multi-Task Mid-Level Vision Datasets from 3D Scans [ICCV 2021]
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
A PyTorch implementation of Connected Components Labeling
My Computer Vision project from my Computer Vision Course (Fall 2020) at Goethe University Frankfurt, Germany. Performance comparison between state-of-the-art Object Detection algorithms YOLO and F…