Skip to content
View wxjiao's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Block or report wxjiao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Public repository for Agent Skills

Python 25,829 2,401 Updated Dec 20, 2025
Jupyter Notebook 169 2 Updated Dec 19, 2025

Sparking "Thinking with Videos" via Reinforcement Learning

Python 1 Updated Oct 30, 2025

Awesome List for Agentic RL

HTML 645 26 Updated Dec 9, 2025

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 476 35 Updated Dec 19, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,058 75 Updated Nov 25, 2025

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 35 Updated Nov 3, 2025
Python 49 2 Updated Dec 10, 2025

Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.

Python 514 54 Updated Dec 1, 2025

**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.

Python 320 7 Updated Nov 3, 2025

Sparking "Thinking with Videos" via Reinforcement Learning

Python 117 3 Updated Oct 30, 2025

Task-Aware Agent-driven Prompt Optimization Framework

Python 3,719 327 Updated Oct 13, 2025

The development and future prospects of large multimodal reasoning models.

560 20 Updated Aug 2, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

2,081 160 Updated Nov 13, 2025

🔥 [EMNLP 2025] Official open-source repo for Boosting Multi-modal Keyphrase Prediction with Dynamic Chain-of-Thought in Vision-Language Models

Python 13 Updated Oct 14, 2025

Marco Search Agent for Realistic and Challenging Agentic Search

Python 240 21 Updated Oct 24, 2025

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

Python 77 7 Updated Nov 13, 2025

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!

Python 3,292 483 Updated Dec 15, 2025

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 873 111 Updated Nov 2, 2025
Python 3 2 Updated Dec 19, 2025

MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents

Python 527 65 Updated Dec 23, 2025

Contexts Optical Compression

Python 21,554 1,927 Updated Oct 25, 2025

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

1,437 92 Updated Oct 11, 2025

Automatic Video Generation from Scientific Papers

Python 2,016 298 Updated Oct 20, 2025

Rubric Reward Model to reduce “miracle steps” and unfaithful CoT in math; SFT+PPO training and verified evaluation.

Python 7 Updated Oct 10, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,760 1,178 Updated Sep 26, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,897 469 Updated Dec 21, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 677 46 Updated Oct 15, 2025

Train your Agent model via our easy and efficient framework

Python 1,668 156 Updated Dec 5, 2025
Next