Skip to content
View fzp0424's full-sized avatar

Block or report fzp0424

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome Agent Environments

14 Updated Apr 10, 2026

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 59,244 4,927 Updated Apr 10, 2026

Learn Claude Code — 基于源码的完整技术分析文档集,15章深度解析 Agent Loop、工具系统、权限系统等核心机制

HTML 20 8 Updated Mar 31, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,466 284 Updated Mar 27, 2026

Roadmap of learning blockchain technology and business knowledge summarized by ZJUBCA(浙大区块链协会总结的区块链知识学习路线)

1,378 208 Updated Jan 9, 2025

A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物

Python 53,397 7,999 Updated Apr 2, 2026

CL-bench: A Benchmark for Context Learning

Python 501 27 Updated Feb 8, 2026

A memory OS that makes your agents more personal while saving tokens.

Python 3,702 394 Updated Apr 10, 2026

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,598 217 Updated Apr 9, 2026

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 46,470 5,729 Updated Apr 10, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,406 539 Updated Apr 10, 2026

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 984 69 Updated Jul 31, 2025

[COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Python 131 7 Updated Mar 19, 2026

AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.

Python 8,282 727 Updated Apr 10, 2026

[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 304 32 Updated Mar 31, 2026

A collection of awesome think with videos papers.

97 2 Updated Dec 1, 2025

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,456 235 Updated Mar 20, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,071 664 Updated Apr 11, 2026

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

Python 678 54 Updated Dec 2, 2025

(原创)全网最全-币圈区块链各类常用工具与相关信息资料大全-虚拟加密货币-欧易OKX币安Binace芝麻开门Gate-交易所App注册-NFT-Defi-加密钱包-比特币-新手入门教程 -持续更新

2,633 229 Updated Jan 31, 2026

A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding

Python 593 27 Updated Apr 11, 2026

A version of verl to support diverse tool use

Python 949 80 Updated Mar 2, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,751 296 Updated Apr 11, 2026

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 921 114 Updated Apr 9, 2026

[EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning

Python 36 Updated Oct 22, 2025

Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"

Python 48 4 Updated Jul 29, 2025

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、LLM大模型、AI Agent、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

3,445 380 Updated Apr 11, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 5,015 459 Updated Apr 11, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,500 2,350 Updated Mar 16, 2026

SoTA open-source TTS

Python 24,249 3,226 Updated Mar 26, 2026
Next