Lists (2)
Sort Name ascending (A-Z)
Stars
Masked Omics Modeling for Multimodal Representation Learning across Histopathology and Molecular Profiles
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Official implementation of GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)
kabachuha / OpenMMDiT
Forked from NUS-HPC-AI-Lab/VideoSysOpen(MM)DiT: An Easy, Fast and Memory-Efficient System for (MM)DiT Training and Inference
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
A comprehensive codebase for training and finetuning Image <> Latent models.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"
A markup-based typesetting system that is powerful and easy to learn.
Toy implementation simplified consistency models using the TrigFlow formulation.
PETPrep: A Robust Preprocessing Pipeline for PET Data
Official implementation of "SynthBA: Reliable Brain Age Estimation Across Multiple MRI Sequences and Resolutions".
Synthstrip integration to be used across nipreps
Repository of the paper "AnyUp: Universal Feature Upsampling".
BM-MAE: Multimodal Masked Autoencoder Pre-training for 3D MRI-based Brain Tumor Analysis with Missing Modalities
Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.
This is a PyTorch implementation of BrainMVP for mpMRI brain image analysis.
Code for MM-DINOv2: Adapting Foundation Models for Multi-Modal Medical Image Analysis (MICCAI2025)