Skip to content
View Benjamin-KY's full-sized avatar
:shipit:
Working and Learning
:shipit:
Working and Learning

Highlights

  • Pro

Organizations

@mlcommons

Block or report Benjamin-KY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Post-training with Tinker

Python 2,611 259 Updated Dec 24, 2025

Working Group on Artificial Intelligence and Machine Learning (AI/ML) Security

128 22 Updated Dec 19, 2025

HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug b…

Python 5,304 1,207 Updated Nov 6, 2025

Context retrieval for AI agents across apps and databases

Python 5,415 654 Updated Dec 24, 2025

The LLM Evaluation Framework

Python 12,719 1,124 Updated Dec 23, 2025

The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"

Shell 80 9 Updated Nov 9, 2024

Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misinformation", accepted by AI Magazine 2024

106 10 Updated Nov 9, 2024

AMITT (Adversarial Misinformation and Influence Tactics and Techniques) framework for describing disinformation incidents. Includes TTPs and countermeasures.

Jupyter Notebook 240 31 Updated Jul 3, 2022

Vanir is a source code-based static analysis tool that automatically identifies the list of missing security patches in the target system. By default, Vanir pulls up-to-date CVEs from Open Source V…

Python 343 30 Updated Oct 17, 2025

[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

Python 181 21 Updated Apr 12, 2025

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

Python 1,720 223 Updated Dec 24, 2025

An adversarial example library for constructing attacks, building defenses, and benchmarking both

Jupyter Notebook 6,397 1,399 Updated Apr 10, 2024

A challenge to explore adversarial robustness of neural networks on MNIST.

Python 757 181 Updated May 3, 2022

🔓 🔓 Find secrets and passwords in container images and file systems 🔓 🔓

Go 3,251 342 Updated Dec 23, 2025

Make your GenAI Apps Safe & Secure 🚀 Test & harden your system prompt

Python 600 86 Updated Sep 23, 2025

Repo for Concierge AI dev work

Python 196 34 Updated Dec 16, 2025

the LLM vulnerability scanner

Python 6,668 736 Updated Dec 22, 2025

Flower: A Friendly Federated AI Framework

Python 6,529 1,121 Updated Dec 24, 2025

DEEPSEC: A Uniform Platform for Security Analysis of Deep Learning Model

Python 226 72 Updated May 21, 2019

💡 Adversarial attacks on explanations and how to defend them

330 48 Updated Nov 30, 2024

A curated list of awesome resources for adversarial examples in deep learning

265 56 Updated Feb 4, 2021

Adversarial attacks and defenses on Graph Neural Networks.

391 32 Updated Feb 22, 2024

Implementation of Papers on Adversarial Examples

Python 397 78 Updated Apr 24, 2023

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Python 400 56 Updated Jan 7, 2022

Raising the Cost of Malicious AI-Powered Image Editing

Jupyter Notebook 641 57 Updated Feb 27, 2023

A pytorch adversarial library for attack and defense methods on images and graphs

Python 1,075 190 Updated Jun 26, 2025

A Toolbox for Adversarial Robustness Research

Jupyter Notebook 1,359 201 Updated Sep 14, 2023

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning mode…

Jupyter Notebook 1,407 266 Updated Feb 15, 2023

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

Python 2,931 435 Updated Dec 3, 2025

Master the fundamentals of machine learning, deep learning, and mathematical optimization by building key concepts and models from scratch using Python.

Python 1,860 630 Updated Dec 22, 2025
Next