Skip to content
View Mwxinnn's full-sized avatar

Highlights

  • Pro

Block or report Mwxinnn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

2,026 122 Updated Oct 27, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 682 41 Updated Nov 4, 2025

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking

Python 132 16 Updated Nov 2, 2025
Python 14,049 1,337 Updated Oct 20, 2025

[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Python 763 34 Updated Jun 9, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,528 1,183 Updated Oct 11, 2025

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

683 15 Updated Nov 5, 2025

This is a project about visual spatial reasoning.

HTML 76 1 Updated Oct 31, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,275 1,201 Updated Oct 28, 2025

ICLR 2024 (Spotlight)

Python 776 21 Updated Mar 2, 2024

A Collection of Papers and Codes for CVPR2025/CVPR2024/CVPR2021/CVPR2020 Low Level Vision

1,510 151 Updated Jul 24, 2025

Collection of the latest spatial, 3D, and video/temporal reasoning papers

25 1 Updated Sep 29, 2025

Awesome Spatial Intelligence (Personal Use)

29 1 Updated Jul 4, 2025
Python 37 3 Updated Jul 6, 2025
Python 217 10 Updated Jun 25, 2025

[MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster"

Python 40 4 Updated Oct 4, 2025

Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals" (NeurIPS 2025)

Python 132 3 Updated Sep 27, 2025

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)

Python 462 15 Updated Feb 11, 2025

Offical Code for Paper "A General Knowledge Injection Framework for ICD Coding" (ACL 2025 Findings)

Jupyter Notebook 11 1 Updated Jun 10, 2025

[ISBI 2023] Official Pytorch implementation of "CMU-Net: A Strong ConvMixer-based Medical Ultrasound Image Segmentation Network"

Python 87 6 Updated Dec 13, 2024

[MICCAI 2024] HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training

Python 21 Updated Nov 17, 2024

[MedIA 2025] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation

Python 35 5 Updated Aug 10, 2025

[ISBI 2024 Oral] Official Pytorch Code base for "CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion"

Python 114 12 Updated Dec 2, 2024

A Pytorch implement of medical image segmentation U-shape architecture benchmarks

Python 118 5 Updated Aug 6, 2025

The official implementation of "ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training"

Python 43 2 Updated Oct 16, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,000 1,079 Updated Nov 18, 2024

The official implementation of AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP

Python 180 11 Updated May 26, 2025

A curated publication list on evidential deep learning.

144 9 Updated Apr 16, 2025

[ICLR 2025] Official code of Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

Python 23 Updated Oct 30, 2025

[MedIA 2025] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation

Python 22 4 Updated Oct 31, 2025
Next