XiaoYee

🎯

Focusing

Xiaoye Qu XiaoYee

🎯

Focusing

Researcher in Shanghai AI Lab. Research on Model Architecture, Multimodal Reasoning, and Efficient Reasoning.

80 followers · 60 following

Achievements

Highlights

Lists (7)

Sort

Stars

shaochenze / calm

Official implementation of "Continuous Autoregressive Language Models"

Python 274 40 Updated Nov 6, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,083 101 Updated Nov 6, 2025

ThinkMorph / ThinkMorph

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 78 2 Updated Nov 5, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 362 16 Updated Nov 4, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,156 41 Updated Nov 5, 2025

TheAgentArk / Toucan

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 163 7 Updated Oct 7, 2025

bingreeky / AgenTracer

AgenTracer: A Lightweight Failure Attributor for Agentic Systems

HTML 56 1 Updated Sep 25, 2025

HKUDS / ViMax

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 614 92 Updated Nov 4, 2025

kowndinya-renduchintala / WIT

On the Effect of Instruction Tuning Loss on Generalization

Python 4 Updated Jul 16, 2025

Haochen-Wang409 / Grasp-Any-Region

Official implementation of "Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs".

Python 88 9 Updated Oct 26, 2025

pprp / Awesome-Efficient-MoE

Efficient Mixture of Experts for LLM Paper List

Python 142 5 Updated Sep 28, 2025

parameterlab / dr-llm

Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"

Python 38 2 Updated Oct 15, 2025

yuzeng0-0 / AGILE

Official Implement of "Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models"

Python 27 Updated Oct 6, 2025

HKUDS / LightAgent

"LightAgent: Lightweight and Cost-Effective Mobile Agents"

Python 36 5 Updated Oct 20, 2025

Gen-Verse / Open-AgentRL

Demystifying Reinforcement Learning in Agentic Reasoning

Python 111 20 Updated Oct 14, 2025

ChnQ / MI-Peaks

Python 55 3 Updated Jul 14, 2025

ulab-uiuc / AgentDebug

Python 46 6 Updated Oct 1, 2025

huaixuheqing / VPPO-RL

Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"

Python 33 4 Updated Oct 16, 2025

lupantech / AgentFlow

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,179 140 Updated Nov 5, 2025

NJU-RL / DIVER

The Official Implementation of DIVER

Python 25 Updated Oct 9, 2025

arcprize / hierarchical-reasoning-model-analysis

Python 154 28 Updated Aug 15, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,795 197 Updated Sep 12, 2025

yunlong10 / Awesome-Video-LMM-Post-Training

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 154 9 Updated Oct 28, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 173 3 Updated Oct 12, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 5,449 773 Updated Oct 8, 2025

justincui03 / Self-Forcing-Plus-Plus

Official Repo for Self-Forcing++ High Quality Long Video Generation

183 3 Updated Oct 13, 2025

lcqysl / FrameThinker-RL

Python 24 3 Updated Oct 9, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,829 159 Updated Oct 9, 2025

MiroMindAI / MiroFlow

MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.

Python 810 101 Updated Nov 5, 2025

OPPO-PersonalAI / Agent_Foundation_Models

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 485 45 Updated Sep 8, 2025

Xiaoye Qu XiaoYee

Highlights

Lists (7)

Competition

GPT-low-resource

HUST

Innovation List

MoE

RLHF

TTS

Stars