- Shenzhen, China
- https://dodoxxb.github.io/
Stars
Official implementation of MatterGen -- a generative model for inorganic materials design across the periodic table that can be fine-tuned to steer the generation towards a wide range of property c…
[CVPR 2026] The official PyTorch implementation of the "Vision Transformer Needs More Than Registers".
CVPR 2025, EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection
Official code for CVPR2026 "PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation"
A collection of spectral descriptors for 3D meshed surfaces
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
[ICCV 2023] Consistent Image Synthesis and Editing
GrapHist: Graph Self-Supervised Learning for Histopathology
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
The hub for EleutherAI's work on interpretability and learning dynamics
Study analyzing how weight tying affects the relationship between input (embedding) and output (unembedding) matrices in language models.
Implementing C-RADIOv4 as a Remote Source Zoo Model for FiftyOne
[CVPR 2026] - IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]
Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"
[Nature Communications 2026] A universal foundation model for grounded biomedical image interpretation
UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling
[MICCAI 2025 Early Accept] PRETI: Patient-Aware Retinal Foundation Model via Metadata-Guided Representation Learning
Official codebase for RetFiner: A Vision-Language Refinement Scheme for Retinal Foundation Models
[IEEE TPAMI 2025] This repository is the official implementation of the paper "VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge"
Versatile and Open Large Models for Ophthalmology
A topic-centric list of HQ open datasets.
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
Identification and Validation Robust Prognostic Biomarkers and Signatures in Solid Tumors
Comparing text compression performance of LLMs with traditional compression approaches.