XiaoYee

Follow

🎯

Focusing

Xiaoye Qu XiaoYee

🎯

Focusing

Follow

Researcher in Shanghai AI Lab. Research on Model Architecture, Multimodal Reasoning, and Efficient Reasoning.

80 followers · 60 following

Achievements

Achievements

Highlights

Pro

Lists (7)

Sort

Competition

GPT-low-resource

15 repositories

HUST

Innovation List

MoE

15 repositories

RLHF

TTS

Stars

OpenDCAI / SciAgent

SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning

39 2 Updated Nov 13, 2025

WujiangXu / A-mem

The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"

Python 681 63 Updated Nov 1, 2025

lcqysl / VideoSSR

Python 17 1 Updated Nov 11, 2025

666ghj / BettaFish

微舆：人人可用的多Agent舆情分析助手，打破信息茧房，还原舆情原貌，预测未来走向，辅助决策！从0实现，不依赖任何框架。

Python 26,598 5,081 Updated Nov 13, 2025

Linzwcs / echos

Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.

Python 47 Updated Nov 10, 2025

cambrian-mllm / cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

Python 311 7 Updated Nov 10, 2025

tongjingqi / Thinking-with-Video

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

176 3 Updated Nov 10, 2025

shaochenze / calm

Official implementation of "Continuous Autoregressive Language Models"

Python 525 64 Updated Nov 10, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,217 115 Updated Nov 13, 2025

ThinkMorph / ThinkMorph

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 91 2 Updated Nov 12, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 387 21 Updated Nov 12, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,228 42 Updated Nov 13, 2025

TheAgentArk / Toucan

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 169 7 Updated Oct 7, 2025

bingreeky / AgenTracer

AgenTracer: A Lightweight Failure Attributor for Agentic Systems

HTML 58 1 Updated Nov 12, 2025

HKUDS / ViMax

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 657 100 Updated Nov 7, 2025

kowndinya-renduchintala / WIT

On the Effect of Instruction Tuning Loss on Generalization

Python 4 Updated Jul 16, 2025

Haochen-Wang409 / Grasp-Any-Region

Official implementation of "Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs".

Python 92 9 Updated Nov 7, 2025

pprp / Awesome-Efficient-MoE

Efficient Mixture of Experts for LLM Paper List

Python 143 5 Updated Sep 28, 2025

parameterlab / dr-llm

Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"

Python 39 2 Updated Oct 15, 2025

yuzeng0-0 / AGILE

Official Implement of "Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models"

Python 27 Updated Oct 6, 2025

HKUDS / LightAgent

"LightAgent: Lightweight and Cost-Effective Mobile Agents"

Python 39 5 Updated Oct 20, 2025

Gen-Verse / Open-AgentRL

Demystifying Reinforcement Learning in Agentic Reasoning

Python 115 21 Updated Oct 14, 2025

ChnQ / MI-Peaks

Python 55 3 Updated Jul 14, 2025

ulab-uiuc / AgentDebug

Python 54 8 Updated Oct 1, 2025

huaixuheqing / VPPO-RL

Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"

Python 34 4 Updated Nov 11, 2025

lupantech / AgentFlow

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,249 154 Updated Nov 5, 2025

NJU-RL / DIVER

The Official Implementation of DIVER

Python 25 Updated Oct 9, 2025

arcprize / hierarchical-reasoning-model-analysis

Python 155 28 Updated Aug 15, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,823 200 Updated Sep 12, 2025

yunlong10 / Awesome-Video-LMM-Post-Training

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 161 9 Updated Oct 28, 2025