Skip to content
View jianzhu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jianzhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 58,297 7,283 Updated Apr 6, 2026

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 171,302 103,854 Updated Apr 6, 2026

Meta-Harness: 76.4% on Terminal-Bench 2.0 (Claude Opus 4.6)

Python 629 95 Updated Mar 26, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,670 476 Updated Apr 5, 2026

Lightweight coding agent that runs in your terminal

Rust 73,338 10,305 Updated Apr 6, 2026

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

TypeScript 48,669 7,853 Updated Apr 1, 2026

AI agents running research on single-GPU nanochat training automatically

Python 66,565 9,546 Updated Mar 26, 2026

2026年最新ChatGPT充值订阅教程(117元/月):本文会重点介绍五种开通ChatGPT Plus会员的方法,包括购买ChatGPT Plus独立账号、为你的ChatGPT代充值、拼车合租ChatGPT Plus账号、使用苹果Apple礼品卡充值ChatGPT会员、使用国外的虚拟信用卡订阅ChatGPT Plus会员。

CSS 897 34 Updated Mar 18, 2026

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 38,073 6,615 Updated Apr 5, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 726 79 Updated Feb 18, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 349,150 69,932 Updated Apr 6, 2026

A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.

Rust 39,354 2,461 Updated Apr 5, 2026

The best ChatGPT that $100 can buy.

Python 51,136 6,745 Updated Mar 27, 2026

Self-Adapting Language Models

Python 1,735 304 Updated Aug 1, 2025

Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature convergence and unlock greater RL potential.

Python 30 2 Updated Oct 10, 2025

A minimal implementation of DeepMind's Genie world model

Python 1,195 98 Updated Feb 28, 2026

Mobile-Agent: The Powerful GUI Agent Family

Python 8,386 847 Updated Mar 31, 2026

Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"

Python 128 9 Updated Feb 4, 2026

Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay

Python 153 10 Updated May 29, 2025

✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 283 22 Updated May 9, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,174 77 Updated Jul 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,457 3,576 Updated Apr 3, 2026

Fully open reproduction of DeepSeek-R1

Python 25,965 2,410 Updated Apr 2, 2026

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,240 58 Updated Aug 27, 2025

PyTorch implementation of AWR.

Python 4 1 Updated Apr 29, 2020

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 261 25 Updated Aug 9, 2025
Python 119 9 Updated Apr 8, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,974 286 Updated May 15, 2025

s1: Simple test-time scaling

Python 6,644 764 Updated Jun 25, 2025
Next