masked-autoencoder

MV-MAE is a hierarchical video model that leverages motion vectors and I-frames from compressed videos to efficiently learn masked motion representations for accurate UAV action recognition.

compressed action-recognition motion-vectors masked-autoencoder vision-transformer

Updated Mar 12, 2026
Python

nttcslab / m2d

Star

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

audio self-supervised-learning masked-autoencoder masked-modeling-duo

Updated Feb 23, 2026
Jupyter Notebook

nttcslab / msm-mae

Star

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

audio deep-learning masked-autoencoder

Updated Feb 20, 2026
Jupyter Notebook

FengheTan9 / Hi-End-MAE

Star

[MedIA 2026] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation

medical-image-analysis medical-image-segmentation self-supervised-learning masked-autoencoder foundation-models masked-image-modeling

Updated Feb 16, 2026
Python

tomoyoshki / InfoMAE

Star

InfoMAE:Pair-Efficient Cross-Modal Alignment for Multimodal Time-Series Sensing

deep-learning multimodal-deep-learning self-supervised-learning masked-autoencoder

Updated Jan 30, 2026
Python

asbjrnmunk / amaes

Star

Masked Autoencoder Pretraining on 3D Brain MRI

segmentation pretrained-models brain-mri self-supervised-learning masked-autoencoder masked-image-modeling

Updated Jan 22, 2026
Python

dionvou / vesuvius_ink_detection

Star

Deep learning models for 3d volumetric ink detection on ancient Vesuvius scroll fragments.

segmentation 3d-models self-supervised-learning masked-autoencoder timesformer swin-transformer

Updated Jan 16, 2026
Jupyter Notebook

RichardScottOZ / grid-mae

Star

Investigate possibilities for Vision Transformers with multiscale grids

grid deep-learning geoscience geophysics autoencoder attention mineral-exploration masked-autoencoder vision-transformer

Updated Jan 11, 2026
Python

mkang315 / PK-YOLO

Star

[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".

Updated Dec 19, 2025
Python

Yusen-Peng / CascadeFormer

Star

[arXiv preprint] 🌊CascadeFormer: A Family of Two-stage Cascading Transformers for Skeleton-based Human Action Recognition

skeleton-based-action-recognition pretraining masked-autoencoder

Updated Dec 10, 2025
Python

YunZhuHuang327 / MAE-Based-Self-Supervised-Aortic-Valve-Detection

Star

A MAE-based self-supervised setup for aortic valve detection. The model is pretrained for 400 epochs with high masking to avoid overfitting, and the resulting encoder features are used entirely within a YOLO-based pipeline for downstream valve detection.

ssl mae self-supervised-learning masked-autoencoder aortic-valve-detection

Updated Dec 6, 2025
Python

SebastianFrazier26 / SaViMAE

Star

Salient Object Detection for Video Masked Auto-Encoders

video-understanding salient-object-detection ucf-101 masked-autoencoder

Updated Nov 17, 2025
Python

isno0907 / LDMAE

Star

Latent Diffusion Models with Masked AutoEncoders (LDMAE) official code

pytorch generative-model iccv diffusion-models masked-autoencoder latent-diffusion latent-diffusion-models flow-matching iccv2025 ldmae

Updated Nov 6, 2025
Python

yc-cui / PEMAE

Star

[TGRS 2024] PEMAE: Pixel-Wise Ensembled Masked Autoencoder for Multispectral Pan-Sharpening

deep-learning pytorch transformer remote-sensing convolutional-neural-networks super-resolution multispectral-images pansharpening masked-autoencoder

Updated Oct 25, 2025
Python

jayenliao / MAE

Star

This repo reproduces key findings from Masked Autoencoders Are Scalable Vision Learners (MAE) on CIFAR-10: self-supervised pretraining improves downstream classification versus training from scratch, and we studied how decoder depth and decoder width affect MAE pretraining and downstream results.

image-classification vit vlm masked-autoencoder mutli-modal

Updated Oct 14, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the masked-autoencoder topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the masked-autoencoder topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

masked-autoencoder

Here are 93 public repositories matching this topic...

Joaggi / sdofmv2

cell-observatory / cell_observatory_platform

Lyce24 / PACX-MAE

OpenGVLab / InternVideo

BryanBradfo / physics-guided-ml

BlazeWild / MV_MAE

nttcslab / m2d

nttcslab / msm-mae

FengheTan9 / Hi-End-MAE

tomoyoshki / InfoMAE

asbjrnmunk / amaes

dionvou / vesuvius_ink_detection

RichardScottOZ / grid-mae

mkang315 / PK-YOLO

Yusen-Peng / CascadeFormer

YunZhuHuang327 / MAE-Based-Self-Supervised-Aortic-Valve-Detection

SebastianFrazier26 / SaViMAE

isno0907 / LDMAE

yc-cui / PEMAE

jayenliao / MAE

Improve this page

Add this topic to your repo