Highlights
- Pro
Stars
COCO API - Dataset @ http://cocodataset.org/
A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)
verl: Volcano Engine Reinforcement Learning for LLMs
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.
[IEEE TMI] Unleashing the Power of Intermediate Domains for Mixed Domain Semi-Supervised Medical Image Segmentation
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
A Survey of Reinforcement Learning for Large Reasoning Models
The development and future prospects of large multimodal reasoning models.
[TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.
Awesome Unified Multimodal Models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[CVPR 2025] Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation
Implementation of MedSegDiff in Pytorch - SOTA medical segmentation using DDPM and filtering of features in fourier space
Implementation of The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation
A hex editor for WeChat/QQ/TIM - PC版微信/QQ/TIM防撤回补丁(我已经看到了,撤回也没用了)
The implementation of the technical report: "Customized Segment Anything Model for Medical Image Segmentation"