Skip to content
View Acerkoo's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Beijing Jiaotong University

Block or report Acerkoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A project implementing various agentic RL based on the Slime post-training framework

Python 251 6 Updated Apr 3, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 939 82 Updated Feb 28, 2026

原汁原昧 Claude Code 可运行,可构建, 可调试版; Typescript 类型全修复; 企业级可靠性; 安全无毒, lock 文件保真, 可直接 bun i; bun run dev 启动

TypeScript 15,120 14,763 Updated Apr 9, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 59,918 7,596 Updated Apr 9, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,778 494 Updated Apr 8, 2026

The agent engineering platform

Python 132,971 21,937 Updated Apr 9, 2026

Kimi Agent SDK provides a programmatic interface to interact with the Kimi CLI

TypeScript 386 51 Updated Apr 3, 2026
Python 3 Updated Mar 18, 2026

LLM Architecture Gallery source data

1,015 78 Updated Apr 4, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 29,598 2,924 Updated Mar 27, 2026

📚 从零开始构建大模型

Jupyter Notebook 28,590 2,653 Updated Mar 16, 2026

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

Python 4,426 330 Updated Sep 2, 2025

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 15,676 882 Updated Mar 31, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 34,899 4,075 Updated Mar 30, 2026

HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing

Python 273 12 Updated Mar 18, 2026

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 78 8 Updated Apr 9, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 353,292 71,305 Updated Apr 9, 2026
Python 271 11 Updated May 14, 2025

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 942 246 Updated Apr 9, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,062 265 Updated Apr 9, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,947 557 Updated Mar 13, 2026

Step-DeepResearch

Python 540 21 Updated Mar 24, 2026

Nano vLLM

Python 12,773 1,896 Updated Nov 3, 2025

Accelerating MoE with IO and Tile-aware Optimizations

Python 625 68 Updated Apr 1, 2026

Machine Learning Engineering Open Book

Python 17,651 1,119 Updated Mar 16, 2026

The Art of Debugging Open Book

Python 1,341 68 Updated Apr 8, 2026

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

Python 60,686 5,214 Updated Apr 9, 2026

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 24,788 3,904 Updated Mar 6, 2026
Next