Stars
[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
一个精心整理的 Mihomo (Clash Meta) 配置文件仓库,通过 GitHub Actions 每日自动同步上游优质规则,提供从入门到进阶的完整解决方案。
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
A Survey of Self-Evolving Agents | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Self-Evolving Agents.
A benchmark environment for fully cooperative human-AI performance.
[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.
An Open-Ended Embodied Agent with Large Language Models
Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A Perspective
[ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Comprehensive tutorials for LangChain, LangGraph, and LangSmith using Groq LLM. Learn to build advanced AI systems, from basics to production-ready applications. Covers key concepts, real-world exa…
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
LangGraph template for a simple ReAct agent
Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patterns.
This repo compiles a collection of examples that demonstrate the effective use of the ReAct pattern in LLM prompting. It includes variations and implementations of agents that leverage the ReAct pa…
A longitudinal reliability benchmark foundation for agent lifespan engineering.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Whale — blazingly fast, terminal-first AI coding agent for DeepSeek. ~98% prompt cache hit rate, 1M context, MCP tools, dynamic workflows.
Hierarchical Reasoning Model Official Release
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Official PyTorch Implementation of Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention
Source code and data in paper "MDFEND: Multi-domain Fake News Detection (CIKM'21)"