hongpeng-guo

Hongpeng Guo hongpeng-guo

Continuous Refactoring

84 followers · 106 following

ByteDance
San Jose
13:43 (UTC -07:00)
https://www.hongpeng-guo.com/

Achievements

x3 x2

Achievements

x3 x2

Stars

jiahaoli57 / Call-for-Reviewers

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

1,150 49 Updated Feb 6, 2026

Terra-Flux / PolyRL

[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.

Python 19 1 Updated Mar 30, 2026

coder / balatrobot

API for developing Balatro bots 🃏

Python 57 14 Updated Jun 15, 2026

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,435 295 Updated Jun 15, 2026

sail-sg / odc

On demand communication

Python 34 2 Updated Apr 16, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,867 79,270 Updated Jun 15, 2026

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,602 233 Updated May 30, 2026

nex-agi / NexRL

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 114 8 Updated May 13, 2026

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 12,047 1,360 Updated Jun 9, 2026

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 37,223 8,704 Updated Mar 26, 2026

inclusionAI / asystem-amem

A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.

C++ 110 11 Updated Dec 17, 2025

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,403 700 Updated May 17, 2026

bytedance / SandboxFusion

Python 1,022 98 Updated May 13, 2026

verl-project / verl-recipe

A set of examples based on verl for end-to-end RL training recipes.

Python 291 134 Updated Jun 9, 2026

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,070 119 Updated Jun 12, 2026

thinking-machines-lab / tinker

Training API and CLI

Python 509 61 Updated May 31, 2026

NVlabs / Long-RL

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 724 28 Updated Sep 24, 2025

google-deepmind / rlax

Python 1,428 101 Updated Jun 12, 2026

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,561 260 Updated Jun 15, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,233 289 Updated Jun 15, 2026

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 1,081 353 Updated Jun 15, 2026

thinking-machines-lab / tinker-project-ideas

Ideas for projects related to Tinker

177 10 Updated Nov 6, 2025

unslothai / unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,582 5,965 Updated Jun 15, 2026

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

10,857 853 Updated Jan 21, 2026

google / tunix

A Lightweight LLM Post-Training Library

Python 2,343 309 Updated Jun 15, 2026

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 125,402 14,030 Updated Jun 15, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,579 851 Updated Jun 15, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,474 447 Updated Jun 15, 2026

thinking-machines-lab / batch_invariant_ops

Python 1,026 78 Updated Nov 4, 2025

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 965 87 Updated Jun 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hongpeng Guo hongpeng-guo

Achievements

Achievements

Block or report hongpeng-guo

Stars

jiahaoli57 / Call-for-Reviewers

Terra-Flux / PolyRL

coder / balatrobot

HazyResearch / ThunderKittens

sail-sg / odc

openclaw / openclaw

CalvinXKY / InfraTech

nex-agi / NexRL

Farama-Foundation / Gymnasium

openai / gym

inclusionAI / asystem-amem

sgl-project / mini-sglang

bytedance / SandboxFusion

verl-project / verl-recipe

ovg-project / kvcached

thinking-machines-lab / tinker

NVlabs / Long-RL

google-deepmind / rlax

radixark / miles

alibaba / ROLL

ai-dynamo / nixl

thinking-machines-lab / tinker-project-ideas

unslothai / unsloth

MoonshotAI / Kimi-K2

google / tunix

excalidraw / excalidraw

kvcache-ai / Mooncake

thinking-machines-lab / tinker-cookbook

thinking-machines-lab / batch_invariant_ops

MoonshotAI / checkpoint-engine