Skip to content
View Acerkoo's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Beijing Jiaotong University

Block or report Acerkoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1 Updated Apr 15, 2026

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 257 21 Updated Apr 17, 2026

The agent that grows with you

Python 96,963 13,633 Updated Apr 18, 2026

A project implementing various agentic RL based on the Slime post-training framework

Python 335 18 Updated Apr 11, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 941 82 Updated Feb 28, 2026

原汁原昧 Claude Code 可运行,可构建, 可调试版; Typescript 类型全修复; 企业级可靠性; 安全无毒, lock 文件保真, 可直接 bun i; bun run dev 启动

TypeScript 16,167 15,178 Updated Apr 17, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 62,337 8,051 Updated Apr 18, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,020 530 Updated Apr 16, 2026

The agent engineering platform

Python 133,891 22,127 Updated Apr 17, 2026

Kimi Agent SDK provides a programmatic interface to interact with the Kimi CLI

TypeScript 398 51 Updated Apr 17, 2026
Python 3 Updated Mar 18, 2026

LLM Architecture Gallery source data

1,041 80 Updated Apr 4, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 29,861 2,948 Updated Apr 13, 2026

📚 从零开始构建大模型

Jupyter Notebook 29,019 2,693 Updated Mar 16, 2026

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

Python 4,437 332 Updated Sep 2, 2025

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 16,328 936 Updated Apr 17, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 37,985 4,506 Updated Apr 17, 2026

HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing

Python 276 13 Updated Mar 18, 2026

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 92 8 Updated Apr 15, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 359,537 73,184 Updated Apr 18, 2026
Python 271 11 Updated May 14, 2025

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

C++ 943 248 Updated Apr 16, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,088 272 Updated Apr 18, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,015 583 Updated Mar 13, 2026

Step-DeepResearch

Python 544 23 Updated Mar 24, 2026

Nano vLLM

Python 12,970 1,945 Updated Apr 13, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 635 73 Updated Apr 17, 2026

Machine Learning Engineering Open Book

Python 17,720 1,124 Updated Mar 16, 2026
Next