Abbey4799

Follow

Qianyu He Abbey4799

Follow

Ph.D. candidate in CS, Fudan University

84 followers · 34 following

Fudan University
Shanghai
https://abbey4799.github.io/

Achievements

Achievements

Stars

XiaomiMiMo / MiMo-Code

TypeScript 8,989 787 Updated Jun 15, 2026

liyucheng09 / Metaphor_Generator

The first Chinese metaphor corpus serving for identification and generation. 中文比喻数据集. Presented at COLING 2022.

Python 48 3 Updated Jan 25, 2023

evolvent-ai / ClawMark

🦞 ClawMark: A Living-World Benchmark for Multi-Day, Multimodal Coworker Agents

Python 110 9 Updated May 28, 2026

zilliztech / memsearch

A persistent, unified memory layer for all your AI agents (e.g. Claude Code, Codex), backed by Markdown and Milvus.

Python 2,034 184 Updated Jun 15, 2026

InternLM / WildClawBench

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

Python 441 41 Updated May 19, 2026

xiaonancs / claude-code-source-analysis

Claude Code 源码深度研究，包括 Foundations/Execution/Infrastructure 三大章节和 23 个子系统的架构分析拆解。

JavaScript 33 3 Updated May 27, 2026

thedotmack / claude-mem

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with C…

JavaScript 82,527 7,139 Updated Jun 15, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,876 79,271 Updated Jun 15, 2026

ultraworkers / claw-code

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,846 109,960 Updated Jun 8, 2026

claw-eval / claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 667 57 Updated May 17, 2026

tobi / qmd

mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local

TypeScript 26,591 1,665 Updated Jun 8, 2026

mem0ai / mem0

Universal memory layer for AI Agents

Python 58,633 6,738 Updated Jun 15, 2026

danny911kr / REALTALK

Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.

Python 45 3 Updated Jul 3, 2025

AMA-Bench / AMA-Bench

[ICML 26] An evaluation framework assessing long-context retention and long-horizon memory performance for agentic applications (AMA-bench).

Python 51 10 Updated Jun 15, 2026

HUST-AI-HYZ / MemoryAgentBench

Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 366 55 Updated May 21, 2026

xiaowu0162 / LongMemEval

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

Python 873 70 Updated May 11, 2026

photon-hq / qclaw-wechat-client

Reverse-engineered TypeScript client for QClaw's WeChat Access API.

TypeScript 801 380 Updated Mar 22, 2026

Tencent-Hunyuan / CL-bench

CL-bench: A Benchmark for Context Learning

Python 559 29 Updated May 12, 2026

WooooDyy / AgentGym

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 801 113 Updated May 30, 2026

apple / ml-entity-deduction-arena

Python 41 6 Updated May 31, 2024

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,409 1,489 Updated Feb 27, 2026

EvanZhuang / knowledge_flow

Official Implementation of Knowledge Flow Prompting

Python 35 2 Updated Oct 20, 2025

brawer / wikidata-qrank

Ranking signals for Wikidata

Go 87 5 Updated Mar 30, 2026

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 526 47 Updated Apr 14, 2026

facebookresearch / deepconf

DeepConf: Deep Think with Confidence

Python 402 59 Updated Jun 10, 2026

Neph0s / CoSER

Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"

Python 199 13 Updated Apr 2, 2026

sotopia-lab / sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 312 46 Updated Jun 5, 2026

centerforaisafety / HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 980 142 Updated Aug 16, 2024

sotopia-lab / sotopia-rl

Sotopia-RL: Reward Design for Social Intelligence

Python 51 9 Updated Apr 1, 2026

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 17,313 1,515 Updated Apr 29, 2026