Skip to content
View xxyQwQ's full-sized avatar

Highlights

  • Pro

Block or report xxyQwQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation for the paper "StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction".

Python 37 8 Updated May 8, 2026

AI handles execution, humans own the direction, and every run becomes an inspectable research artifact on disk.

Python 858 23 Updated Jun 15, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 194,162 109,907 Updated Jun 8, 2026

LatentMem: Customizing Latent Memory for Multi-Agent Systems

Python 47 8 Updated Feb 9, 2026

🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。

27,242 2,158 Updated Jun 21, 2026

分流规则、重写写规则及脚本。

JavaScript 26,885 3,997 Updated Jun 21, 2026

Elevate your AI research writing, no more tedious polishing ✨

29,214 2,252 Updated May 18, 2026

⏰ Agenticly track worldwide conference deadlines (Website, Python Cli, Wechat Applet)

Rust 9,101 607 Updated Jun 20, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 2,037 201 Updated Jun 9, 2026

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Python 165 4 Updated Jun 2, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,640 576 Updated Jun 22, 2026
Python 241 28 Updated Jul 25, 2025

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 526 47 Updated Apr 14, 2026

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

Python 4,831 585 Updated Jun 12, 2026

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 3,019 165 Updated Jul 9, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,671 972 Updated Jun 17, 2026

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,694 133 Updated Nov 21, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 142,627 20,510 Updated Jun 22, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,381 8,855 Updated Jun 22, 2026

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 803 114 Updated May 30, 2026

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

8,155 495 Updated Sep 12, 2025

Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".

Python 203 10 Updated Dec 24, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 5,010 1,122 Updated Sep 4, 2025

A repo lists papers related to LLM based agent

Python 2,320 151 Updated Jul 12, 2025

Must-read Papers on LLM Agents.

3,056 183 Updated Jun 18, 2026

😎 Awesome lists about all kinds of interesting topics

477,948 35,503 Updated Jun 2, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 117,936 13,783 Updated Jun 22, 2026
Next