-
The Chinese University of Hong Kong
- Hong Kong SAR, China
- https://xxyqwq.cn/
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Implementation for the paper "StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction".
AI handles execution, humans own the direction, and every run becomes an inspectable research artifact on disk.
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
LatentMem: Customizing Latent Memory for Multi-Agent Systems
🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。
Elevate your AI research writing, no more tedious polishing ✨
⏰ Agenticly track worldwide conference deadlines (Website, Python Cli, Wechat Applet)
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
A very simple GRPO implement for reproducing r1-like LLM thinking.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
A repo lists papers related to LLM based agent
😎 Awesome lists about all kinds of interesting topics
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.