Skip to content
View romanlutz's full-sized avatar

Organizations

@fairlearn

Block or report romanlutz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AI model safety scanner built on NVIDIA garak

Ruby 561 90 Updated Jun 18, 2026
Jupyter Notebook 29 8 Updated Oct 22, 2024

Open Machine Learning

PHP 743 124 Updated Jun 17, 2026

Open detection standard -- like Sigma, but for AI agents. 425 rules, shipped in Microsoft AGT, Cisco AI Defense, MISP, OWASP A-S-R-H. 97.1% recall on NVIDIA garak. NIST OSCAL Path 1.

TypeScript 258 33 Updated Jun 19, 2026

Collection of evals for Inspect AI

Python 546 357 Updated Jun 19, 2026

Inspect: A framework for large language model evaluations

Python 2,224 567 Updated Jun 18, 2026

Repository for "StrongREJECT for Empty Jailbreaks" paper

Jupyter Notebook 158 7 Updated Nov 3, 2024

Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts

Python 588 84 Updated Feb 27, 2026

A database migrations tool for SQLAlchemy.

Python 4,209 341 Updated May 31, 2026

A pytest-native safety and security testing framework for agentic AI applications

Python 360 42 Updated Jun 18, 2026

Repository for "Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models"

Python 2 Updated Apr 27, 2026

Squad: AI agent teams for any project

TypeScript 2,832 426 Updated Jun 18, 2026

A tool that validates academic paper references

Python 406 48 Updated Jun 13, 2026

Gas Town - multi-agent workspace manager

Go 15,972 1,487 Updated Jun 17, 2026

An extremely fast Python type checker and language server, written in Rust.

Python 18,986 306 Updated Jun 19, 2026

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…

Python 5,557 3,313 Updated Jun 19, 2026

Playwright MCP server

TypeScript 34,105 2,825 Updated Jun 10, 2026

[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Python 55 11 Updated Feb 9, 2026

Benchmarking LLM agents on Cyber Threat Investigation.

Jupyter Notebook 128 23 Updated May 4, 2026

Simple Prompt Injection Kit for Evaluation and Exploitation

Python 200 43 Updated Jun 12, 2026

Recursively scan a Python module and export numpydoc docstrings to JSON

TypeScript 4 1 Updated May 28, 2026

Open-Source Apprenticeship Program

3 Updated May 22, 2025

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 125 12 Updated Dec 2, 2024
Python 10 3 Updated Jun 16, 2026

File support for asyncio

Python 3,249 166 Updated Oct 9, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,917 2,186 Updated Apr 13, 2026

AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.

TypeScript 1,977 296 Updated Feb 13, 2026

Gather metrics on issues/prs/discussions such as time to first response, count of issues opened, closed, etc.

Python 532 90 Updated Jun 18, 2026

A Text-Based Environment for Interactive Debugging

Python 298 40 Updated Jun 18, 2026
Jupyter Notebook 3 Updated Feb 25, 2025
Next