High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,976 1,104 Updated Apr 20, 2026

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 41,097 5,006 Updated Oct 10, 2025

AccumulateMore / CV

✅（已完结）超级全面的深度学习笔记【土堆 Pytorch】【李沐动手学深度学习】【吴恩达深度学习】【大飞大模型Agent】

Jupyter Notebook 21,966 2,513 Updated Apr 27, 2026

safishamsi / graphify

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a querya…

Python 68,885 6,949 Updated Jun 18, 2026

lucasastorian / llmwiki

Open Source Implementation of Karpathy's LLM Wiki. Upload documents, connect your Claude account via MCP, and have it write your wiki !

Python 1,146 184 Updated Jun 16, 2026

nashsu / llm_wiki

LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…

TypeScript 11,885 1,443 Updated Jun 18, 2026

ifromeast / AI_analysis

analyse problems of AI with Math and Code

Jupyter Notebook 31 4 Updated Jul 28, 2025

agentica-project / rllm

Jupyter Notebook 393 32 Updated Sep 17, 2025

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,227 507 Updated Oct 30, 2025

NousResearch / Hermes-Function-Calling

Jupyter Notebook 1,393 199 Updated Dec 22, 2025

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

4,653 385 Updated Apr 29, 2026

THUDM / AgentRL

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 301 25 Updated Jan 17, 2026

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,908 1,382 Updated Apr 13, 2026

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 60,178 7,404 Updated Jun 11, 2026

containers / bubblewrap

Low-level unprivileged sandboxing tool used by Flatpak and similar projects

C 7,655 350 Updated Jun 2, 2026

bytedance / deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 71,462 9,695 Updated Jun 18, 2026

aaasjp aaasjp

Lists (7)

agentic-rl

ai-video

creative-products

edu-startup

🔮 Future ideas

llm-learning

train-llm-from-scratch

Starred repositories

langgraph-agents

langchain-agent

document-parser