Skip to content
View XiaoYee's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report XiaoYee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning

39 2 Updated Nov 13, 2025

The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"

Python 681 63 Updated Nov 1, 2025
Python 17 1 Updated Nov 11, 2025

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

Python 26,598 5,081 Updated Nov 13, 2025

Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.

Python 47 Updated Nov 10, 2025

Cambrian-S: Towards Spatial Supersensing in Video

Python 311 7 Updated Nov 10, 2025

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

176 3 Updated Nov 10, 2025

Official implementation of "Continuous Autoregressive Language Models"

Python 525 64 Updated Nov 10, 2025

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,217 115 Updated Nov 13, 2025

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 91 2 Updated Nov 12, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 387 21 Updated Nov 12, 2025

Native Multimodal Models are World Learners

Python 1,228 42 Updated Nov 13, 2025

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 169 7 Updated Oct 7, 2025

AgenTracer: A Lightweight Failure Attributor for Agentic Systems

HTML 58 1 Updated Nov 12, 2025

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 657 100 Updated Nov 7, 2025

On the Effect of Instruction Tuning Loss on Generalization

Python 4 Updated Jul 16, 2025

Official implementation of "Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs".

Python 92 9 Updated Nov 7, 2025

Efficient Mixture of Experts for LLM Paper List

Python 143 5 Updated Sep 28, 2025

Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"

Python 39 2 Updated Oct 15, 2025

Official Implement of "Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models"

Python 27 Updated Oct 6, 2025

"LightAgent: Lightweight and Cost-Effective Mobile Agents"

Python 39 5 Updated Oct 20, 2025

Demystifying Reinforcement Learning in Agentic Reasoning

Python 115 21 Updated Oct 14, 2025
Python 55 3 Updated Jul 14, 2025
Python 54 8 Updated Oct 1, 2025

Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"

Python 34 4 Updated Nov 11, 2025

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,249 154 Updated Nov 5, 2025

The Official Implementation of DIVER

Python 25 Updated Oct 9, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,823 200 Updated Sep 12, 2025

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 161 9 Updated Oct 28, 2025
Next