hhaAndroid

Follow

Haian Huang(深度眸) hhaAndroid

Follow

LLM&MLLM Infra, RL

724 followers · 44 following

nuaa
上海
12:03 (UTC -12:00)

Achievements

Achievements

Organizations

Stars

QwenLM / FlashQLA

high-performance linear attention kernel library built on TileLang

Python 363 26 Updated Apr 30, 2026

alibaba / OpenSandbox

Secure, Fast, and Extensible Sandbox runtime for AI agents.

Python 10,383 824 Updated Apr 29, 2026

redai-infra / Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 341 32 Updated Apr 30, 2026

hellock / agent-harness-eval

Python 8 Updated Apr 17, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 126,740 18,980 Updated May 1, 2026

rasbt / mini-coding-agent

Minimal and readable coding agent harness implementation in Python to explain the core components of coding agents.

Python 783 148 Updated Apr 7, 2026

LMIS-ORG / slime-agentic

A project implementing various agentic RL based on the Slime post-training framework

Python 369 19 Updated Apr 11, 2026

Tencent-Hunyuan / CL-bench

CL-bench: A Benchmark for Context Learning

Python 524 29 Updated Apr 30, 2026

claw-eval / claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 510 43 Updated Apr 29, 2026

pinchbench / skill

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

Python 1,084 121 Updated Apr 30, 2026

BerriAI / litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 45,351 7,690 Updated May 1, 2026

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 4,132 568 Updated Apr 27, 2026

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 19,111 2,061 Updated Apr 27, 2026

sierra-research / tau2-bench

τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Python 1,097 279 Updated Apr 30, 2026

harbor-framework / harbor

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 1,735 971 Updated Apr 30, 2026

harbor-framework / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 2,119 507 Updated Jan 22, 2026

rlops / rlix

Run more RL experiments. Wait less for GPUs.

Python 276 17 Updated Apr 24, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,186 554 Updated Apr 30, 2026

VoltAgent / awesome-openclaw-skills

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

47,651 4,668 Updated Apr 20, 2026

xianyu110 / awesome-openclaw-tutorial

从零开始玩转OpenClaw：最全面的中文教程，涵盖安装、配置、实战案例和避坑指南（github版）

Shell 4,316 632 Updated Apr 16, 2026

QwenLM / Qwen3.6

Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.

3,228 207 Updated Apr 22, 2026

InternLM / InternBootcamp

Python 345 25 Updated Aug 29, 2025

vndee / llm-sandbox

Lightweight and portable LLM sandbox runtime (code interpreter) Python library.

Python 1,048 99 Updated Apr 20, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 366,792 75,342 Updated Apr 30, 2026

nex-agi / NexRL

NexRL is an ultra-loosely-coupled LLM post-training framework.

Python 104 6 Updated Apr 27, 2026

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 2,233 155 Updated Nov 4, 2025

vipshop / cache-dit

A PyTorch-native inference engine with cache, parallelism, quantization for Diffusion Transformers.

Python 1,156 70 Updated Apr 29, 2026

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,834 490 Updated Feb 10, 2026

strands-rl / strands-sglang

SGLang model provider for Strands Agents for on-policy agentic RL training.

Python 52 8 Updated Apr 22, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,084 603 Updated Mar 13, 2026