Skip to content
View Abbey4799's full-sized avatar

Block or report Abbey4799

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TypeScript 8,989 787 Updated Jun 15, 2026

The first Chinese metaphor corpus serving for identification and generation. 中文比喻数据集. Presented at COLING 2022.

Python 48 3 Updated Jan 25, 2023

🦞 ClawMark: A Living-World Benchmark for Multi-Day, Multimodal Coworker Agents

Python 110 9 Updated May 28, 2026

A persistent, unified memory layer for all your AI agents (e.g. Claude Code, Codex), backed by Markdown and Milvus.

Python 2,034 184 Updated Jun 15, 2026

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

Python 441 41 Updated May 19, 2026

Claude Code 源码深度研究,包括 Foundations/Execution/Infrastructure 三大章节和 23 个子系统的架构分析拆解。

JavaScript 33 3 Updated May 27, 2026

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with C…

JavaScript 82,527 7,139 Updated Jun 15, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,876 79,271 Updated Jun 15, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,846 109,960 Updated Jun 8, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 667 57 Updated May 17, 2026

mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local

TypeScript 26,591 1,665 Updated Jun 8, 2026

Universal memory layer for AI Agents

Python 58,633 6,738 Updated Jun 15, 2026

Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.

Python 45 3 Updated Jul 3, 2025

[ICML 26] An evaluation framework assessing long-context retention and long-horizon memory performance for agentic applications (AMA-bench).

Python 51 10 Updated Jun 15, 2026

Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 366 55 Updated May 21, 2026

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

Python 873 70 Updated May 11, 2026

Reverse-engineered TypeScript client for QClaw's WeChat Access API.

TypeScript 801 380 Updated Mar 22, 2026

CL-bench: A Benchmark for Context Learning

Python 559 29 Updated May 12, 2026

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 801 113 Updated May 30, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,409 1,489 Updated Feb 27, 2026

Official Implementation of Knowledge Flow Prompting

Python 35 2 Updated Oct 20, 2025

Ranking signals for Wikidata

Go 87 5 Updated Mar 30, 2026

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 526 47 Updated Apr 14, 2026

DeepConf: Deep Think with Confidence

Python 402 59 Updated Jun 10, 2026

Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"

Python 199 13 Updated Apr 2, 2026

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 312 46 Updated Jun 5, 2026

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 980 142 Updated Aug 16, 2024

Sotopia-RL: Reward Design for Social Intelligence

Python 51 9 Updated Apr 1, 2026

The absolute trainer to light up AI agents.

Python 17,313 1,515 Updated Apr 29, 2026
Next