A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

693 16 Updated Nov 7, 2025

SIBench / Awesome-Visual-Spatial-Reasoning

This is a project about visual spatial reasoning.

HTML 76 1 Updated Oct 31, 2025

Tencent-Hunyuan / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,303 1,208 Updated Oct 28, 2025

MC-E / DragonDiffusion

ICLR 2024 (Spotlight)

Python 776 21 Updated Mar 2, 2024

Kobaayyy / Awesome-CVPR2025-CVPR2024-CVPR2021-CVPR2020-Low-Level-Vision

A Collection of Papers and Codes for CVPR2025/CVPR2024/CVPR2021/CVPR2020 Low Level Vision

1,513 151 Updated Jul 24, 2025

arijitray1993 / awesome-spatial-reasoning

Collection of the latest spatial, 3D, and video/temporal reasoning papers

25 1 Updated Sep 29, 2025

lif314 / Awesome-Spatial-Intelligence

Awesome Spatial Intelligence (Personal Use)

29 1 Updated Jul 4, 2025

Hoyyyaard / 3DFlowAction

Python 38 3 Updated Jul 6, 2025

oDaiSuno / ScholAI

Python 217 10 Updated Jun 25, 2025

FengheTan9 / LLM4Seg

[MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster"

Python 40 4 Updated Oct 4, 2025

brown-palm / force-prompting

Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals" (NeurIPS 2025)

Python 132 3 Updated Sep 27, 2025

HL-hanlin / Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)

Python 462 15 Updated Feb 11, 2025

xuzhang0112 / GKI-ICD

Offical Code for Paper "A General Knowledge Injection Framework for ICD Coding" (ACL 2025 Findings)

Jupyter Notebook 11 1 Updated Jun 10, 2025

FengheTan9 / CMU-Net

[ISBI 2023] Official Pytorch implementation of "CMU-Net: A Strong ConvMixer-based Medical Ultrasound Image Segmentation Network"

Python 87 6 Updated Dec 13, 2024

FengheTan9 / HySparK

[MICCAI 2024] HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training

Python 21 Updated Nov 17, 2024

FengheTan9 / MambaMIM

[MedIA 2025] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation

Python 37 5 Updated Aug 10, 2025

FengheTan9 / CMUNeXt

[ISBI 2024 Oral] Official Pytorch Code base for "CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion"

Python 114 12 Updated Dec 2, 2024

FengheTan9 / Medical-Image-Segmentation-Benchmarks

A Pytorch implement of medical image segmentation U-shape architecture benchmarks

Python 119 5 Updated Aug 6, 2025

ToniChopp / ECAMP

The official implementation of "ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training"

Python 44 2 Updated Oct 16, 2025

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,008 1,079 Updated Nov 18, 2024

Mwxinnn / AA-CLIP

The official implementation of AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP

Python 181 11 Updated May 26, 2025

MengyuanChen21 / Awesome-Evidential-Deep-Learning

A curated publication list on evidential deep learning.

144 9 Updated Apr 16, 2025

FouierL / Di-Fusion

[ICLR 2025] Official code of Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

Python 23 Updated Oct 30, 2025

FengheTan9 / Hi-End-MAE

[MedIA 2025] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation

Python 22 4 Updated Oct 31, 2025