Skip to content
View aaasjp's full-sized avatar
  • deepglint
  • 22:19 (UTC -12:00)

Block or report aaasjp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

From Agent to Agency — 一个 AI 做不了的事,一群 AI 可以。Multi-Agent Collaboration Platform.

TypeScript 40 5 Updated Jun 16, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,453 492 Updated Jun 9, 2026

A self-hosted dashboard that puts all your feeds in one place

Go 35,217 1,366 Updated May 30, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 673 59 Updated May 17, 2026

Nano vLLM

Python 14,087 2,231 Updated Apr 26, 2026

Playground for Transformers

Python 54 17 Updated Dec 16, 2023

Fully open reproduction of DeepSeek-R1

Python 26,327 2,444 Updated Apr 2, 2026

[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Python 391 24 Updated Mar 30, 2026

A agent framework based on the tutorial hello-agents

Python 2,077 513 Updated Jun 8, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,030 4,092 Updated Jun 18, 2026

A set of examples based on verl for end-to-end RL training recipes.

Python 295 138 Updated Jun 16, 2026

RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios

Python 555 56 Updated Jun 12, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,506 597 Updated May 23, 2026

Awesome List for Agentic RL

HTML 1,613 62 Updated May 26, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,976 1,104 Updated Apr 20, 2026

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 41,097 5,006 Updated Oct 10, 2025

✅(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 21,966 2,513 Updated Apr 27, 2026

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a querya…

Python 68,885 6,949 Updated Jun 18, 2026

Open Source Implementation of Karpathy's LLM Wiki. Upload documents, connect your Claude account via MCP, and have it write your wiki !

Python 1,146 184 Updated Jun 16, 2026

LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…

TypeScript 11,885 1,443 Updated Jun 18, 2026

analyse problems of AI with Math and Code

Jupyter Notebook 31 4 Updated Jul 28, 2025
Jupyter Notebook 393 32 Updated Sep 17, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,227 507 Updated Oct 30, 2025
Jupyter Notebook 1,393 199 Updated Dec 22, 2025

Curated list of datasets and tools for post-training.

4,653 385 Updated Apr 29, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 301 25 Updated Jan 17, 2026

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,908 1,382 Updated Apr 13, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 60,178 7,404 Updated Jun 11, 2026

Low-level unprivileged sandboxing tool used by Flatpak and similar projects

C 7,655 350 Updated Jun 2, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 71,462 9,695 Updated Jun 18, 2026
Next