XiaoYee

🎯

Focusing

Xiaoye Qu XiaoYee

🎯

Focusing

Researcher in Shanghai AI Lab. Research on Model Architecture, Multimodal Reasoning, and Efficient Reasoning.

80 followers · 60 following

Achievements

Highlights

Lists (7)

Sort

Stars

762 results for source starred repositories

Clear filter

lcqysl / VideoSSR

Python 2 Updated Nov 10, 2025

666ghj / BettaFish

微舆：人人可用的多Agent舆情分析助手，打破信息茧房，还原舆情原貌，预测未来走向，辅助决策！从0实现，不依赖任何框架。

Python 24,848 4,750 Updated Nov 10, 2025

Linzwcs / echos

Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.

Python 47 Updated Nov 10, 2025

cambrian-mllm / cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

Python 225 3 Updated Nov 9, 2025

tongjingqi / Thinking-with-Video

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

151 3 Updated Nov 10, 2025

shaochenze / calm

Official implementation of "Continuous Autoregressive Language Models"

Python 455 56 Updated Nov 10, 2025

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,133 107 Updated Nov 10, 2025

ThinkMorph / ThinkMorph

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 89 2 Updated Nov 5, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 377 18 Updated Nov 10, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,197 43 Updated Nov 7, 2025

TheAgentArk / Toucan

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 166 7 Updated Oct 7, 2025

bingreeky / AgenTracer

AgenTracer: A Lightweight Failure Attributor for Agentic Systems

HTML 57 1 Updated Sep 25, 2025

HKUDS / ViMax

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 642 95 Updated Nov 7, 2025

kowndinya-renduchintala / WIT

On the Effect of Instruction Tuning Loss on Generalization

Python 4 Updated Jul 16, 2025

Haochen-Wang409 / Grasp-Any-Region

Official implementation of "Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs".

Python 89 9 Updated Nov 7, 2025

pprp / Awesome-Efficient-MoE

Efficient Mixture of Experts for LLM Paper List

Python 143 5 Updated Sep 28, 2025

parameterlab / dr-llm

Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"

Python 39 2 Updated Oct 15, 2025

yuzeng0-0 / AGILE

Official Implement of "Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models"

Python 27 Updated Oct 6, 2025

HKUDS / LightAgent

"LightAgent: Lightweight and Cost-Effective Mobile Agents"

Python 37 5 Updated Oct 20, 2025

Gen-Verse / Open-AgentRL

Demystifying Reinforcement Learning in Agentic Reasoning

Python 111 21 Updated Oct 14, 2025

ChnQ / MI-Peaks

Python 55 3 Updated Jul 14, 2025

ulab-uiuc / AgentDebug

Python 51 8 Updated Oct 1, 2025

huaixuheqing / VPPO-RL

Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"

Python 34 4 Updated Nov 7, 2025

lupantech / AgentFlow

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,218 150 Updated Nov 5, 2025

NJU-RL / DIVER

The Official Implementation of DIVER

Python 25 Updated Oct 9, 2025

arcprize / hierarchical-reasoning-model-analysis

Python 155 28 Updated Aug 15, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,810 199 Updated Sep 12, 2025

yunlong10 / Awesome-Video-LMM-Post-Training

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 158 9 Updated Oct 28, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 176 5 Updated Oct 12, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 5,492 783 Updated Oct 8, 2025

Xiaoye Qu XiaoYee

Highlights

Lists (7)

Competition

GPT-low-resource

HUST

Innovation List

MoE

RLHF

TTS

Stars