A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

692 15 Updated Nov 7, 2025

HL-hanlin / Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)

Python 462 15 Updated Feb 11, 2025

oDaiSuno / ScholAI

Python 217 10 Updated Jun 25, 2025

Mwxinnn / AA-CLIP

The official implementation of AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP

Python 181 11 Updated May 26, 2025

MengyuanChen21 / Awesome-Evidential-Deep-Learning

A curated publication list on evidential deep learning.

144 9 Updated Apr 16, 2025

FengheTan9 / U-Bench

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking

Python 132 16 Updated Nov 6, 2025

brown-palm / force-prompting

Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals" (NeurIPS 2025)

Python 132 3 Updated Sep 27, 2025

FengheTan9 / Medical-Image-Segmentation-Benchmarks

A Pytorch implement of medical image segmentation U-shape architecture benchmarks

Python 118 5 Updated Aug 6, 2025

FengheTan9 / CMUNeXt

[ISBI 2024 Oral] Official Pytorch Code base for "CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion"

Python 114 12 Updated Dec 2, 2024

FengheTan9 / CMU-Net

[ISBI 2023] Official Pytorch implementation of "CMU-Net: A Strong ConvMixer-based Medical Ultrasound Image Segmentation Network"

Python 87 6 Updated Dec 13, 2024

SIBench / Awesome-Visual-Spatial-Reasoning

This is a project about visual spatial reasoning.

HTML 76 1 Updated Oct 31, 2025

XOR-op / BoltConn

Privacy-oriented proxy & network manager, supporting WireGuard, L7 firewall, App-based policies and scripted MitM.

Rust 64 4 Updated Oct 2, 2025

ToniChopp / ECAMP

The official implementation of "ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training"

Python 44 2 Updated Oct 16, 2025

FengheTan9 / LLM4Seg

[MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster"

Python 40 4 Updated Oct 4, 2025

Hoyyyaard / 3DFlowAction

Python 38 3 Updated Jul 6, 2025

FengheTan9 / MambaMIM

[MedIA 2025] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation

Python 35 5 Updated Aug 10, 2025

lif314 / Awesome-Spatial-Intelligence

Awesome Spatial Intelligence (Personal Use)

29 1 Updated Jul 4, 2025

arijitray1993 / awesome-spatial-reasoning

Collection of the latest spatial, 3D, and video/temporal reasoning papers

25 1 Updated Sep 29, 2025