Skip to content
View kduxin's full-sized avatar

Highlights

  • Pro

Block or report kduxin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 2 Updated Sep 12, 2025

[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 220 24 Updated Apr 30, 2026

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 5,306 438 Updated Jun 23, 2026

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python 43 3 Updated Nov 9, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 2,129 429 Updated Aug 13, 2025

[ICLR 2025] Automated Design of Agentic Systems

Python 1,597 239 Updated Jan 28, 2025

[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)

Python 822 78 Updated Feb 4, 2026

一个精心整理的 Mihomo (Clash Meta) 配置文件仓库,通过 GitHub Actions 每日自动同步上游优质规则,提供从入门到进阶的完整解决方案。

Shell 2,136 215 Updated Jun 23, 2026
Python 5 Updated Jan 13, 2026

An Open Flexible Quadrotor Simulator

C++ 1,380 401 Updated Jun 14, 2024

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,093 4,110 Updated Jun 23, 2026

A Survey of Self-Evolving Agents | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Self-Evolving Agents.

270 18 Updated Jun 7, 2026

A benchmark environment for fully cooperative human-AI performance.

Jupyter Notebook 981 219 Updated Mar 22, 2025

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 942 102 Updated Nov 16, 2025

The NetHack Learning Environment

C 984 129 Updated May 6, 2024

An Open-Ended Embodied Agent with Large Language Models

JavaScript 6,996 681 Updated Apr 3, 2024

各厂家 Coding Plan 实际价值对比

1,827 29 Updated Jun 17, 2026

Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A Perspective

76 1 Updated Feb 9, 2026

[ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems

Python 1,001 160 Updated Jun 18, 2026

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 17,249 1,965 Updated Jun 17, 2026

A programming framework for agentic AI

Python 59,179 8,922 Updated Apr 15, 2026

LangGraph创建agent的中文文档

212 20 Updated Nov 18, 2024

Comprehensive tutorials for LangChain, LangGraph, and LangSmith using Groq LLM. Learn to build advanced AI systems, from basics to production-ready applications. Covers key concepts, real-world exa…

Jupyter Notebook 81 18 Updated May 6, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 2,980 3,520 Updated Apr 22, 2026

2026 好用的付费机场推荐

2,629 129 Updated May 16, 2026

LangGraph template for a simple ReAct agent

Python 776 690 Updated Jun 20, 2026

Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patterns.

JavaScript 4,298 622 Updated May 31, 2026

This repo compiles a collection of examples that demonstrate the effective use of the ReAct pattern in LLM prompting. It includes variations and implementations of agents that leverage the ReAct pa…

Python 167 54 Updated May 30, 2025

A longitudinal reliability benchmark foundation for agent lifespan engineering.

Python 14 Updated Jun 5, 2026
Next