Highlights
- Pro
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
verl: Volcano Engine Reinforcement Learning for LLMs
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
A Survey of Reinforcement Learning for Large Reasoning Models
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
[TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)
A hex editor for WeChat/QQ/TIM - PC版微信/QQ/TIM防撤回补丁(我已经看到了,撤回也没用了)
Awesome Unified Multimodal Models
The development and future prospects of large multimodal reasoning models.
OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.
[IEEE TMI] Unleashing the Power of Intermediate Domains for Mixed Domain Semi-Supervised Medical Image Segmentation
[CVPR 2025] Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation
Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
Implementation of The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation
[CVPR 2024] Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[CVPR 2023] Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.