Skip to content
View zhangchen-xu's full-sized avatar
🍞
Baking...
🍞
Baking...

Highlights

  • Pro

Organizations

@uw-nsl @magpie-align

Block or report zhangchen-xu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 38,996 2,838 Updated Dec 19, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 25,884 3,268 Updated Dec 19, 2025

A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.

Python 925 124 Updated Dec 11, 2025

All parts of Claude Code's system prompt, 20 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash cmd, secur…

JavaScript 1,702 256 Updated Dec 19, 2025

https://astro-multiplepage-portfolio.edgeone.app/

Astro 5 1 Updated Dec 8, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 163,754 52,311 Updated Dec 20, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 330 77 Updated Oct 29, 2025

Kimi CLI is your next CLI agent.

Python 3,641 359 Updated Dec 19, 2025

GenAI Agent Framework, the Pydantic way

Python 13,885 1,489 Updated Dec 20, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,123 877 Updated Dec 20, 2025
Python 32 1 Updated Dec 1, 2025

Add your HDD, SSD and NVMe drives to your Synology's compatible drive database and a lot more

Shell 5,041 331 Updated Dec 18, 2025

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Python 35 1 Updated Oct 26, 2025

On-device TTS model by Neuphonic

Python 4,273 448 Updated Dec 15, 2025

Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale

Python 5,537 512 Updated Dec 19, 2025

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 32,669 5,077 Updated Dec 20, 2025

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 188 7 Updated Dec 16, 2025

A Python tool that automatically cleans, completes, and standardizes BibTeX entries using LLMs and web search.

Python 161 6 Updated Dec 4, 2025

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 713 121 Updated Dec 14, 2025

MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability

Python 37 5 Updated Dec 17, 2025

A collection of LLM memes

343 4 Updated Sep 22, 2025

Synthetic data curation for post-training and structured data extraction

Python 1,581 126 Updated Jul 29, 2025

MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.

Python 352 25 Updated Dec 13, 2025

MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents

Python 525 64 Updated Dec 9, 2025

Public Evaluation Result Archieve for BFCL

Python 23 2 Updated Dec 17, 2025

Code and Data for Tau-Bench

Python 1,021 160 Updated Aug 28, 2025

MCP-based Agent Deep Evaluation System

Python 139 16 Updated Sep 26, 2025
TypeScript 27,792 2,202 Updated Aug 7, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,752 1,070 Updated Dec 20, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,048 643 Updated Dec 19, 2025
Next