Skip to content
View SII-zyj's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Fudan University

Block or report SII-zyj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 750 33 Updated Mar 20, 2026

qqr is an RL training framework for open-ended agents.

Python 228 20 Updated Mar 25, 2026

[CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Python 215 13 Updated Mar 27, 2026

A Clinical Agentic Reasoning Engine to Enhance Real-World Diagnostic Accuracy via Structured Medical Reasoning

Python 4 Updated Dec 8, 2025

CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Learning

Python 127 1 Updated Dec 1, 2025

✨✨ [ICLR 2026] Think Beyond Images

Python 594 35 Updated Sep 23, 2025

A version of verl to support diverse tool use

Python 933 78 Updated Mar 2, 2026

Awesome List for Agentic RL

HTML 899 39 Updated Mar 24, 2026

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 414 17 Updated Jan 29, 2026

A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more inte…

59 Updated Sep 1, 2025

VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning

Python 330 15 Updated Feb 9, 2026

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 358 8 Updated Jun 1, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,393 42 Updated Mar 9, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,406 1,306 Updated Mar 29, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,780 364 Updated Mar 26, 2026

The Largest-scale Chinese Medical QA Dataset: with 26,000,000 question answer pairs.

322 31 Updated Mar 14, 2024
Python 1,171 72 Updated Nov 20, 2025

RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 161 15 Updated Jun 26, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,089 117 Updated Jul 29, 2024

SigmaFlow is a Python package designed to optimize the performance of task-flow related to LLMs/MLLMs or Multi-agent.

Python 3 Updated Mar 2, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,734 674 Updated Mar 28, 2026

Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.

Python 49 10 Updated Feb 7, 2025

[BS]物联网工程,[MS]计算机技术,python,mooc资源,机器学习,深度学习,cryo-em[冷冻电子显微镜],3D reconstruction[三维重建],Computational Vison。

19 3 Updated Sep 23, 2019