-
Fudan University
Stars
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
qqr is an RL training framework for open-ended agents.
[CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
A Clinical Agentic Reasoning Engine to Enhance Real-World Diagnostic Accuracy via Structured Medical Reasoning
CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Learning
A version of verl to support diverse tool use
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more inte…
VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
The Largest-scale Chinese Medical QA Dataset: with 26,000,000 question answer pairs.
RM-R1: Unleashing the Reasoning Potential of Reward Models
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
SigmaFlow is a Python package designed to optimize the performance of task-flow related to LLMs/MLLMs or Multi-agent.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.
[BS]物联网工程,[MS]计算机技术,python,mooc资源,机器学习,深度学习,cryo-em[冷冻电子显微镜],3D reconstruction[三维重建],Computational Vison。