-
Microsoft
- Greater Seattle Area, WA, USA
- https://romanlutz.github.io
- in/romanlutz
- @romanlutz.bsky.social
Starred repositories
Open detection standard -- like Sigma, but for AI agents. 425 rules, shipped in Microsoft AGT, Cisco AI Defense, MISP, OWASP A-S-R-H. 97.1% recall on NVIDIA garak. NIST OSCAL Path 1.
Collection of evals for Inspect AI
Inspect: A framework for large language model evaluations
Repository for "StrongREJECT for Empty Jailbreaks" paper
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
A database migrations tool for SQLAlchemy.
A pytest-native safety and security testing framework for agentic AI applications
Repository for "Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models"
A tool that validates academic paper references
An extremely fast Python type checker and language server, written in Rust.
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments
Benchmarking LLM agents on Cyber Threat Investigation.
Simple Prompt Injection Kit for Evaluation and Exploitation
Recursively scan a Python module and export numpydoc docstrings to JSON
Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
A simple screen parsing tool towards pure vision based GUI agent
AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.
Gather metrics on issues/prs/discussions such as time to first response, count of issues opened, closed, etc.
A Text-Based Environment for Interactive Debugging