llm-security

Here are 99 public repositories matching this topic...

recetariodmix / garak

🔍 Discover vulnerabilities in LLMs with garak, a tool that probes for weaknesses like hallucination, data leakage, and misinformation effectively.

ai vulnerability-assessment redteaming github-config responsible-ai security-scanners llmops llm-security llm-evaluation garak

Updated Dec 14, 2025
Python

snowz123 / team-agents

Star

🐙 Team Agents unifica 82 especialistas en IA para resolver desafíos con chat inteligente, analista de requisitos y subida de documentos. Plataforma futurista y modular.

agent azure datasources agents multi-agents llm generative-ai chatgpt langchain prompt-testing llma-index llm-security llm-vulnerabilities agent-simulations llm-evaluation llm-evaluation-framework llm-guardrails llm-scanner

Updated Dec 14, 2025
Python

NVIDIA-NeMo / Guardrails

Star

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

python nvidia safety agents guardrails llms generative-ai llm-security llm-safety

Updated Dec 14, 2025
Python

openguardrails / openguardrails

Star

The first open-source, customizable AI guardrails with user-defined scanners and custom model training support. It protects the entire AI inference pipeline—including prompts, models, agents, and outputs. Redefining runtime AI security for enterprise AI-powered applications.

jailbreaking guardrails prompt-injection llm-security data-leakage-prevention llm-safety content-safety

Updated Dec 13, 2025
Python

sookoothaii / llm-security-firewall

Star

Bidirectional LLM security firewall providing risk reduction (not complete protection) for human/LLM interfaces. Hexagonal architecture with multi-layer validation of inputs, outputs, memory and tool state. Beta status. ~528 KB wheel, optional ML guards.

Updated Dec 13, 2025
Python

NVIDIA / garak

Star

the LLM vulnerability scanner

ai vulnerability-assessment security-scanners llm-security llm-evaluation

Updated Dec 12, 2025
Python

musabdulai-io / ai-security-scanner

Star

Security scanner for LLM/RAG applications - Test for prompt injection, jailbreaks, PII leakage, hallucinations & more

cli security jailbreak owasp penetration-testing red-team ai-safety vulnerability-scanner ai-security rag fastapi pii-detection llm prompt-injection llm-security

Updated Dec 12, 2025
Python

Pantheon-Security / medusa

Star

Multi-language security scanner with 64 analyzers + AI Agent Security. NEW: React2Shell CVE-2025-55182 detection (CVSS 10.0). Scan Python, JS, Go, Rust, Docker, Terraform, MCP & more. 11,500+ downloads. AGPL-3.0.

react python open-source security mcp scanner nextjs static-analysis code-analysis devsecops vulnerability-scanner sast ai-security llm-security cve-detection

Updated Dec 12, 2025
Python

vaniseth / EmpathAI-Trustworthy-Public-Health-Chatbot

Star

A Trustworthy and Secure Conversational Agent for Mental Healthcare

healthcare mental-health trustworthy-ai llm-security prompt-injection-llm-security

Updated Dec 11, 2025
Python

Tencent / AI-Infra-Guard

Star

A.I.G (AI-Infra-Guard) is a comprehensive, intelligent, and easy-to-use AI Red Teaming platform developed by Tencent Zhuque Lab.

agent security benchmark ai mcp scanner jailbreak vulnerability-scanners security-tools red-teaming ai-infra llm llm-security

Updated Dec 11, 2025
Python

trustyai-explainability / llama-stack-provider-trustyai-garak

Star

Out-Of-Tree Llama Stack Eval Provider for Red Teaming LLM Systems with Garak

redteaming responsible-ai llmops llm-security garak

Updated Dec 11, 2025
Python

LostOxygen / llm-confidentiality

Star

Whispers in the Machine: Confidentiality in Agentic Systems

security machine-learning framework deep-learning transformers openai prompt-toolkit gpt confidentiality systems-security llm prompt-engineering chatgpt prompt-injection llm-security

Updated Dec 11, 2025
Python

OWASP / www-project-top-10-for-large-language-model-applications

Sponsor

Star

OWASP Top 10 for Large Language Model Apps (Part of the GenAI Security Project)

ai appsec llm llm-security

Updated Dec 9, 2025
Python

ole2412 / Prompt-Injection-in-LLMs

Star

Implemented and evaluated protection mechanisms to determine their effectiveness against direct prompt injection attacks.

mcp interactive-visualizations guardrails llm-security

Updated Dec 9, 2025
Python

SAP / STARS

Star

AI agent whose purpose is to conduct vulnerability tests on LLMs from SAP AI Core or from local deployments, or models from HuggingFace. The goal of this project is to identify and correct any potential security vulnerabilities.

security ai ai-agents ai-security llm llm-security

Updated Dec 8, 2025
Python

protectai / llm-guard

Star

The Security Toolkit for LLM Interactions

transformers security-tools adversarial-machine-learning large-language-models llm prompt-engineering chatgpt llmops prompt-injection llm-security

Updated Dec 8, 2025
Python

gensecaihq / mcpscc

Star

Security Command Center for Model Context Protocol (MCP) servers. Detect prompt injection, tool poisoning, secrets, and vulnerabilities. The Trivy of MCP security.