Stars
[ICCV 2025] MMReason, MLLMs, step by step, reasoning benchmark, AGI
[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1
Empowering MLLM for Grounded ECG Understanding with Time Series and Images [NeurIPS 2025]
[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large Language Model, Straberry
DBPM is a simple algorithm designed as a lightweight plug-in without learnable parameters to enhance the performance of time series contrastive learning.
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Collection of AWESOME vision-language models for vision tasks