Skip to content
View hemingkx's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@polyunlp

Block or report hemingkx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 1,040 72 Updated May 12, 2026

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Jupyter Notebook 249 24 Updated Sep 2, 2025

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 19,187 1,905 Updated Feb 2, 2026

Awesome list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.

Python 969 87 Updated May 17, 2026

The agent that grows with you

Python 154,915 24,822 Updated May 18, 2026

🛠️ Awesome tools & guides for harness engineering.

2,498 190 Updated May 12, 2026

SkillsBench evaluates how well skills work and how effective agents are at using them

PDDL 1,176 295 Updated May 16, 2026

Memento-Skills: Let Agents Design Agents

Python 1,378 155 Updated Apr 24, 2026

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Python 738 56 Updated May 17, 2026

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 43,190 4,898 Updated May 14, 2026

Specification and documentation for Agent Skills

Python 18,778 1,142 Updated Apr 22, 2026

AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution

Python 415 46 Updated May 10, 2026

contains the list of papers of agent skills

239 2 Updated Mar 19, 2026

A paper list of Awesome Latent Space.

855 34 Updated Apr 30, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 124,406 20,481 Updated May 18, 2026

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,384 373 Updated May 12, 2026

Your private AI assistant on your phone: simple, safe, and ready anytime. 你手机里的私人 AI 助手:简单、安全,随时可用。

Kotlin 1,082 94 Updated Apr 30, 2026

The open source coding agent.

TypeScript 161,734 19,012 Updated May 18, 2026

"Parallel Test-Time Scaling for Latent Reasoning Models"

Python 21 2 Updated Apr 12, 2026

AI agents running research on single-GPU nanochat training automatically

Python 81,556 11,857 Updated Mar 26, 2026

DFlash: Block Diffusion for Flash Speculative Decoding

Python 4,622 329 Updated May 10, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 372,711 77,269 Updated May 18, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,331 580 Updated May 12, 2026

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 42,662 7,506 Updated May 17, 2026

The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"

Python 33 Updated Mar 26, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 876 95 Updated Feb 18, 2026

CL-bench: A Benchmark for Context Learning

Python 547 30 Updated May 12, 2026

[SIGIR 2026] "One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment"

Python 15 2 Updated Apr 21, 2026

"Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space"

Python 14 4 Updated Jan 21, 2026
Next