-
Purdue University
- West Lafayette, IN, USA
- https://kaiyuanzhang.com
- @KaiyuanZh
Highlights
Stars
Source code for Cascading and Proxy Membership Inference Attacks. NDSS 2026.
[EMNLP 2025] Profiler: Black-box AI-generated Text Origin Detection via Context-aware Inference Pattern Analysis
[COLM 2025] Official implementation of μKE - edit LLM knowledge while preserving memory dependencies via Matryoshka-style objectives.
[USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks
A collection of papers related to steering of (multimodal) large language models.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Living Off The Land Binaries And Scripts - (LOLBins and LOLScripts)
[arXiv:2411.10023] "Model Inversion Attacks: A Survey of Approaches and Countermeasures"
An open protocol enabling communication and interoperability between opaque agentic applications.
Model Context Protocol Servers
No fortress, purely open ground. OpenManus is Coming.
DSPy: The framework for programming—not prompting—language models
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Tools for merging pretrained large language models.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
[ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Official Implementation of NeurIPS 2024 paper - BiScope: AI-generated Text Detection by Checking Memorization of Preceding Tokens
Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"
[NDSS 2025] CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling