zhuzilin

🛏️

躺平躺平......

Zilin Zhu zhuzilin

🛏️

躺平躺平......

☀️ RL infra @Z.ai, ex WeChat AI

1.9k followers · 165 following

Z.ai
Beijing
21:06 (UTC +08:00)
https://www.zhihu.com/people/zhu-xiao-lin-22-96

Achievements

x4 x2 x3

Achievements

x4 x2 x3

Starred repositories

obra / superpowers

An agentic skills framework & software development methodology that works.

Shell 227,457 20,225 Updated Jun 13, 2026

vllm-project / vime

An LLM post-training framework with vLLM for RL Scaling

Python 238 15 Updated Jun 14, 2026

Tencent-Hunyuan / UniRL

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 589 32 Updated Jun 14, 2026

awslabs / agentcore-rl-toolkit

Toolkit for Seamlessly Enabling RL Training on Any Agent with Bedrock AgentCore.

Python 43 4 Updated Jun 11, 2026

LMCache / LMCache

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

Python 9,023 1,312 Updated Jun 14, 2026

taco-project / FlexKV

Python 282 53 Updated Jun 9, 2026

THU-KEG / LongTraceRL

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Python 37 Updated Jun 1, 2026

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 9,969 892 Updated Jun 12, 2026

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 17,308 1,514 Updated Apr 29, 2026

NVIDIA-NeMo / ProRL-Agent-Server

Agentic RL on Any Harness at Scale

Python 554 57 Updated Jun 13, 2026

OpenBMB / ForgeTrain

Python 230 21 Updated May 26, 2026

restsend / pipa

A fast, minimal ES2023 JavaScript runtime built in Rust.

Rust 59 3 Updated Jun 13, 2026

StarTrail-org / LEANN

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 11,918 1,061 Updated Jun 9, 2026

tigerbeetle / tigerbeetle

The financial transactions database designed for mission critical safety and performance.

Zig 16,230 831 Updated Jun 13, 2026

farion1231 / cc-switch

A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io

Rust 100,439 6,631 Updated Jun 14, 2026

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 5,151 709 Updated Jun 13, 2026

Hacksore / cursed-repo

Average JavaScript repo 🫪

JavaScript 91 3 Updated May 15, 2026

slime-n / slime-n

A Multi-Policy, Multi-Agent RL Training Framework

Python 30 1 Updated Jun 2, 2026

toyaix / tritonllm

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model

Python 118 6 Updated Apr 28, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,376 1,044 Updated Jun 4, 2026

IcyFish332 / T3RL

Python 47 5 Updated Apr 15, 2026

redai-infra / Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 423 45 Updated Jun 13, 2026

LMIS-ORG / slime-agentic

A project implementing various agentic RL based on the Slime post-training framework

Python 461 32 Updated Apr 11, 2026

inclusionAI / cuLA

CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.

Python 519 64 Updated Jun 12, 2026

sanbuphy / learn-coding-agent

Research on Coding Agents

11,994 19,703 Updated Apr 1, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 193,178 33,748 Updated Jun 14, 2026

tiajinsha / JKVideo

高颜值第三方 B 站 React Native 客户端

TypeScript 4,989 2,920 Updated May 12, 2026

jackwener / OpenCLI

Make Any Website into CLI & Use your logged-in browser by AI agent.

JavaScript 24,319 2,434 Updated Jun 14, 2026

openai / codex

Lightweight coding agent that runs in your terminal

Rust 90,953 13,428 Updated Jun 14, 2026

stepfun-ai / SteptronOss

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 575 43 Updated May 18, 2026

Zilin Zhu zhuzilin

Starred repositories

WebRTC

path-tracing