zhangchen-xu

🍞

Baking...

Zhangchen Xu zhangchen-xu

🍞

Baking...

PhD student, NSL@UW. LLM safety & alignment & synthetic data generation.

78 followers · 15 following

University of Washington
Seattle
18:53 (UTC -08:00)
https://zhangchenxu.com
@zhangchen_xu

Achievements

Highlights

Organizations

Stars

daytonaio / daytona

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 38,996 2,838 Updated Dec 19, 2025

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 25,884 3,268 Updated Dec 19, 2025

MiniMax-AI / Mini-Agent

A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.

Python 925 124 Updated Dec 11, 2025

Piebald-AI / claude-code-system-prompts

All parts of Claude Code's system prompt, 20 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash cmd, secur…

JavaScript 1,702 256 Updated Dec 19, 2025

tomcomtang / astro-multiplepage-portfolio

https://astro-multiplepage-portfolio.edgeone.app/

Astro 5 1 Updated Dec 8, 2025

n8n-io / n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 163,754 52,311 Updated Dec 20, 2025

LeonGuertler / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 330 77 Updated Oct 29, 2025

MoonshotAI / kimi-cli

Kimi CLI is your next CLI agent.

Python 3,641 359 Updated Dec 19, 2025

pydantic / pydantic-ai

GenAI Agent Framework, the Pydantic way

Python 13,885 1,489 Updated Dec 20, 2025

skypilot-org / skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,123 877 Updated Dec 20, 2025

BestiVictory / APDDv2

Python 32 1 Updated Dec 1, 2025

007revad / Synology_HDD_db

Add your HDD, SSD and NVMe drives to your Synology's compatible drive database and a lot more

Shell 5,041 331 Updated Dec 18, 2025

HowieHwong / Agentic-Guardian

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Python 35 1 Updated Oct 26, 2025

neuphonic / neutts-air

On-device TTS model by Neuphonic

Python 4,273 448 Updated Dec 15, 2025

Klavis-AI / klavis

Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale

Python 5,537 512 Updated Dec 19, 2025

BerriAI / litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 32,669 5,077 Updated Dec 20, 2025

TheAgentArk / Toucan

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 188 7 Updated Dec 16, 2025

takashiishida / bibfixer

A Python tool that automatically cleans, completes, and standardizes BibTeX entries using LLMs and web search.

Python 161 6 Updated Dec 4, 2025

texttron / tevatron

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 713 121 Updated Dec 14, 2025

mcp-tool-bench / MCPToolBenchPP

MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability

Python 37 5 Updated Dec 17, 2025

fenxer / llm-things

A collection of LLM memes

343 4 Updated Sep 22, 2025

bespokelabsai / curator

Synthetic data curation for post-training and structured data extraction

Python 1,581 126 Updated Jul 29, 2025

eval-sys / mcpmark

MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.

Python 352 25 Updated Dec 13, 2025

SalesforceAIResearch / MCP-Universe

MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents

Python 525 64 Updated Dec 9, 2025

HuanzhiMao / BFCL-Result

Public Evaluation Result Archieve for BFCL

Python 23 2 Updated Dec 17, 2025

sierra-research / tau-bench

Code and Data for Tau-Bench

Python 1,021 160 Updated Aug 28, 2025

SalesforceAIResearch / MCPEval

MCP-based Agent Deep Evaluation System

Python 139 16 Updated Sep 26, 2025

voideditor / void

TypeScript 27,792 2,202 Updated Aug 7, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,752 1,070 Updated Dec 20, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,048 643 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhangchen Xu zhangchen-xu

Achievements

Achievements

Highlights

Organizations

Block or report zhangchen-xu

Stars

daytonaio / daytona

HumanSignal / label-studio

MiniMax-AI / Mini-Agent

Piebald-AI / claude-code-system-prompts

tomcomtang / astro-multiplepage-portfolio

n8n-io / n8n

LeonGuertler / TextArena

MoonshotAI / kimi-cli

pydantic / pydantic-ai

skypilot-org / skypilot

BestiVictory / APDDv2

007revad / Synology_HDD_db

HowieHwong / Agentic-Guardian

neuphonic / neutts-air

Klavis-AI / klavis

BerriAI / litellm

TheAgentArk / Toucan

takashiishida / bibfixer

texttron / tevatron

mcp-tool-bench / MCPToolBenchPP

fenxer / llm-things

bespokelabsai / curator

eval-sys / mcpmark

SalesforceAIResearch / MCP-Universe

HuanzhiMao / BFCL-Result

sierra-research / tau-bench

SalesforceAIResearch / MCPEval

voideditor / void

modelscope / ms-swift

OpenPipe / ART