Stars
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents
[ICLR 2025] Automated Design of Agentic Systems
[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
一个精心整理的 Mihomo (Clash Meta) 配置文件仓库,通过 GitHub Actions 每日自动同步上游优质规则,提供从入门到进阶的完整解决方案。
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
A Survey of Self-Evolving Agents | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Self-Evolving Agents.
A benchmark environment for fully cooperative human-AI performance.
[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.
An Open-Ended Embodied Agent with Large Language Models
Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A Perspective
[ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Comprehensive tutorials for LangChain, LangGraph, and LangSmith using Groq LLM. Learn to build advanced AI systems, from basics to production-ready applications. Covers key concepts, real-world exa…
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
LangGraph template for a simple ReAct agent
Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patterns.
This repo compiles a collection of examples that demonstrate the effective use of the ReAct pattern in LLM prompting. It includes variations and implementations of agents that leverage the ReAct pa…
A longitudinal reliability benchmark foundation for agent lifespan engineering.