Skip to content
View hongpeng-guo's full-sized avatar
:octocat:
:octocat:

Block or report hongpeng-guo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

1,150 49 Updated Feb 6, 2026

[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.

Python 19 1 Updated Mar 30, 2026

API for developing Balatro bots 🃏

Python 57 14 Updated Jun 15, 2026

Tile primitives for speedy kernels

Cuda 3,435 295 Updated Jun 15, 2026

On demand communication

Python 34 2 Updated Apr 16, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,867 79,270 Updated Jun 15, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,602 233 Updated May 30, 2026

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 114 8 Updated May 13, 2026

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 12,047 1,360 Updated Jun 9, 2026

A toolkit for developing and comparing reinforcement learning algorithms.

Python 37,223 8,704 Updated Mar 26, 2026

A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.

C++ 110 11 Updated Dec 17, 2025

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,403 700 Updated May 17, 2026
Python 1,022 98 Updated May 13, 2026

A set of examples based on verl for end-to-end RL training recipes.

Python 291 134 Updated Jun 9, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,070 119 Updated Jun 12, 2026

Training API and CLI

Python 509 61 Updated May 31, 2026

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 724 28 Updated Sep 24, 2025
Python 1,428 101 Updated Jun 12, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,561 260 Updated Jun 15, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,233 289 Updated Jun 15, 2026

NVIDIA Inference Xfer Library (NIXL)

C++ 1,081 353 Updated Jun 15, 2026

Ideas for projects related to Tinker

177 10 Updated Nov 6, 2025

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,582 5,965 Updated Jun 15, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,857 853 Updated Jan 21, 2026

A Lightweight LLM Post-Training Library

Python 2,343 309 Updated Jun 15, 2026

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 125,402 14,030 Updated Jun 15, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,579 851 Updated Jun 15, 2026

Post-training with Tinker

Python 3,474 447 Updated Jun 15, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 965 87 Updated Jun 8, 2026
Next