Skip to content
View kduxin's full-sized avatar

Highlights

  • Pro

Block or report kduxin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)

Python 821 78 Updated Feb 4, 2026

一个精心整理的 Mihomo (Clash Meta) 配置文件仓库,通过 GitHub Actions 每日自动同步上游优质规则,提供从入门到进阶的完整解决方案。

Shell 2,083 212 Updated Jun 19, 2026
Python 5 Updated Jan 13, 2026

An Open Flexible Quadrotor Simulator

C++ 1,377 401 Updated Jun 14, 2024

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,034 4,095 Updated Jun 18, 2026

A Survey of Self-Evolving Agents | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Self-Evolving Agents.

262 17 Updated Jun 7, 2026

A benchmark environment for fully cooperative human-AI performance.

Jupyter Notebook 979 220 Updated Mar 22, 2025

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 941 102 Updated Nov 16, 2025

The NetHack Learning Environment

C 984 130 Updated May 6, 2024

An Open-Ended Embodied Agent with Large Language Models

JavaScript 6,991 680 Updated Apr 3, 2024

各厂家 Coding Plan 实际价值对比

1,723 28 Updated Jun 17, 2026

Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A Perspective

76 1 Updated Feb 9, 2026

[ICML 2026 Spotlight] Latent Collaboration in Multi-Agent Systems

Python 994 159 Updated Jun 18, 2026

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 17,229 1,954 Updated Jun 17, 2026

A programming framework for agentic AI

Python 59,061 8,908 Updated Apr 15, 2026

LangGraph创建agent的中文文档

211 21 Updated Nov 18, 2024

Comprehensive tutorials for LangChain, LangGraph, and LangSmith using Groq LLM. Learn to build advanced AI systems, from basics to production-ready applications. Covers key concepts, real-world exa…

Jupyter Notebook 81 18 Updated May 6, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 2,953 3,498 Updated Apr 22, 2026

2026 好用的付费机场推荐

2,566 126 Updated May 16, 2026

LangGraph template for a simple ReAct agent

Python 773 690 Updated Jun 7, 2026

Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patterns.

JavaScript 4,284 619 Updated May 31, 2026

This repo compiles a collection of examples that demonstrate the effective use of the ReAct pattern in LLM prompting. It includes variations and implementations of agents that leverage the ReAct pa…

Python 166 54 Updated May 30, 2025

A longitudinal reliability benchmark foundation for agent lifespan engineering.

Python 14 Updated Jun 5, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 133,234 21,543 Updated Jun 19, 2026

Whale — blazingly fast, terminal-first AI coding agent for DeepSeek. ~98% prompt cache hit rate, 1M context, MCP tools, dynamic workflows.

Go 662 44 Updated Jun 18, 2026

Hierarchical Reasoning Model Official Release

Python 12,548 1,829 Updated Mar 31, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,948 372 Updated Jun 3, 2026

Official PyTorch Implementation of Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Python 220 17 Updated May 25, 2026

Source code and data in paper "MDFEND: Multi-domain Fake News Detection (CIKM'21)"

Python 247 40 Updated Nov 23, 2022
Next