Starred repositories
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
UltraRAG v2: A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
12 Lessons to Get Started Building AI Agents
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
A live stream development of RL tunning for LLM agents
No fortress, purely open ground. OpenManus is Coming.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Build resilient language agents as graphs.
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
✨✨Latest Advances on Multimodal Large Language Models
Task-Aware Agent-driven Prompt Optimization Framework
Implementations of Reinforcement Learning and Planning algorithms
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
GPT4V-level open-source multi-modal model based on Llama3-8B
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
Chinese Vision-Language Understanding Evaluation
Model Context Protocol Servers
Official implementation of paper "Optimizing Decomposition for Optimal Claim Verification"
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
verl: Volcano Engine Reinforcement Learning for LLMs
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL