llm-security

Star

Here are 38 public repositories matching this topic...

Giskard-AI / giskard

Sponsor

Star

🐢 Open-Source Evaluation & Testing for AI & LLM systems

Updated Feb 14, 2025
Python

NVIDIA / garak

Star

the LLM vulnerability scanner

ai vulnerability-assessment security-scanners llm-security llm-evaluation

Updated Feb 19, 2025
Python

protectai / llm-guard

Star

The Security Toolkit for LLM Interactions

transformers security-tools adversarial-machine-learning large-language-models llm prompt-engineering chatgpt llmops prompt-injection llm-security

Updated Jan 13, 2025
Python

msoedov / agentic_security

Star

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

agent-framework ai-red-team prompt-testing llm-security llm-vulnerabilities llm-evaluation llm-fuzzing llm-evaluation-framework llm-guardrails llm-scanner llm-jailbreaks llm-fuzzer llm-fuzzer-aggregator agent-security

Updated Feb 18, 2025
Python

EasyJailbreak / EasyJailbreak

Star

An easy-to-use Python framework to generate adversarial jailbreak prompts.

jailbreak discrete-optimization large-language-model llm-security llm-safety-benchmark jailbreak-framework

Updated Sep 2, 2024
Python

chawins / llm-sp

Star

Papers and resources related to the security and privacy of LLMs 🤖

security privacy awesome-list adversarial-machine-learning llm llm-security llm-privacy

Updated Nov 27, 2024
Python

deadbits / vigil-llm

Star

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

security-tools adversarial-machine-learning adversarial-attacks yara-scanner large-language-models llmops prompt-injection llm-security

Updated Jan 31, 2024
Python

liu00222 / Open-Prompt-Injection

Star

This repository provides implementation to formalize and benchmark Prompt Injection attacks and defenses

security-and-privacy llm llms prompt-injection llm-security prompt-injection-tool

Updated Jan 22, 2025
Python

ZenGuard-AI / fast-llm-security-guardrails

Star

The fastest && easiest LLM security guardrails for CX AI Agents and applications.

security llm-security llm-privacy prompt-security llm-guard llm-guardrails cx-agent

Updated Feb 14, 2025
Python

raga-ai-hub / raga-llm-hub

Star

Framework for LLM evaluation, guardrails and security

guardrails llmops llm-security llm-evaluation

Updated Sep 9, 2024
Python

arekusandr / last_layer

Star

Ultra-fast, low latency LLM prompt injection/jailbreak detection ⛓️

jailbreak security-tools large-language-models prompt-engineering chatgpt-prompts llm-security llm-local llm-guard llm-guardrails

Updated Jul 26, 2024
Python

RomiconEZ / llamator

Star

Framework for testing vulnerabilities of large language models (LLM).

Updated Feb 19, 2025
Python

llm-platform-security / SecGPT

Star

An Execution Isolation Architecture for LLM-Based Agentic Systems

sandbox gpt isolation multi-agent-systems openai-api llm chatgpt langchain llm-agent llm-security llm-framework llm-privacy llm-platform llm-based-systems

Updated Jan 31, 2025
Python

microsoft / BIPIA

Star

A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.

llm-security

Updated Apr 15, 2024
Python

Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to potentially execute offline remote code execution without running any actual code on the victim's machine or thwart LLM-based fraud/moderation systems.

backdoor-attacks llm-security qwen2-5