-
PolyU
- Hong Kong, China
- https://hemingkx.github.io/
- @hemingkx
Highlights
- Pro
Stars
"Optical Reasoning: Rethinking Images as an Expressive Reasoning Medium Beyond Text"
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Official repository for textual frequency law
agent multiplexer that lives in your terminal.
Browser Harness | Self-healing harness that enables LLMs to complete any task.
Official style files for papers submitted to venues of the Association for Computational Linguistics
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
A project implementing various agentic RL based on the Slime post-training framework
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.
Awesome list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.
The agent that grows with you
🛠️ Awesome tools & guides for harness engineering.
SkillsBench evaluates how well skills work and how effective agents are at using them
Memento-Skills: Let Agents Design Agents
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Specification and documentation for Agent Skills
AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution
contains the list of papers of agent skills
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
General technology for enabling AI capabilities w/ LLMs and MLLMs
Your private AI assistant on your phone: simple, safe, and ready anytime. 你手机里的私人 AI 助手:简单、安全,随时可用。