-
The University of Chicago
- Chicago
-
06:08
(UTC -05:00) - www.chengquanguo.com
- in/chengquan-guo-ba57782b9
- @ChengquanGuo
Stars
This is the official code for "Black-box Optimization of LLM Outputs by Asking for Directions"
TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning
[ICLR 2026] Official implementation for "RedCodeAgent: Automatic Red-teaming Agent against Diverse Code Agents"
[NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Source codes for paper "MACRec: A Multi-Agent Collaboration Framework for Recommendation" at SIGIR 2024
Yunjue Agent: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks
Create beautiful slides on the web using a coding agent's frontend skills
Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.
An Open Foundation Model and Benchmark to Accelerate Generative Recommendation
[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
This repository provides a benchmark for prompt injection attacks and defenses in LLMs
[ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents
Agentic AI research papers, benchmarks, frameworks, and tools curated across 24 domains.
Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
A modern cookiecutter template for deep learning projects with pytorch lightning that use uv for dependency management
Build and deploy stateful agents across federated resources
Evaluating Agent Safety in Realistic, High-Risk Simulations
A benchmark for LLMs on complicated tasks in the terminal
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)
🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025