Stars
AtlasPatch: An Efficient and Scalable Tool for Whole Slide Image Preprocessing
1K resolution vision transformers pretrained on 1B human images.
Official implementation of MuViT (CVPR 2026), a ViT-based architecture for multi-scale modelling of gigapixel microscopy images.
A patient-first foundation model for computational pathology. Two-stage training (self-supervised slide encoding + supervised case-level alignment) on 77K+ public whole-slide images across 333 clin…
A parameter-efficient mixture-of-experts module for computational pathology - ICLR
GenBio-PathFM is a histopathology foundation model from GenBio AI.
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
[NeurIPS25 D&B Spotlight] A tile-level histopathology image understanding benchmark
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
[ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"
[CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning".
Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
Fast Vision Mamba : Pool your Spatial Dimensions for Accelerated Processing
[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models
Advanced Privacy-Preserving Federated Learning framework
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
Pytorch implementation of paper "Contrastive Learning with Synthetic Positives"