llm-security

Star

Here are 101 public repositories matching this topic...

jmassengille / Parseon

Star

Security scanner for code and logs of AI-powered applications

python cli static-analysis security-tools ai-security sbom cyclonedx llm-security

Updated Jul 14, 2025
Python

dewittgibson-kpmg / LLMGuardian

Star

Comprehensive LLM AI Model protection - cybersecurity toolset aligned to addressing OWASP vulnerabilities - https://genai.owasp.org/llm-top-10/

cybersecurity owasp-top-10 llm-security ai-security-tool

Updated Jan 28, 2025
Python

juyterman1000 / llm-safety

Star

Stop prompt injections in 20ms. The safety toolkit every LLM app needs. No API keys, no complex setup, just `pip install llm-guard` and you're protected.

Updated Aug 23, 2025
Python

josephedward / SemFire

Star

Now in Org Repo ⬇️

semantic-analysis llm-security

Updated Oct 21, 2025
Python

awesome-software / llm-guard

Star

The Security Toolkit for LLM Interactions

llm-security

Updated Sep 21, 2023
Python

schoi1337 / m2m-bypass-sim

Star

Simulating prompt injection and guardrail bypass across chained LLMs in security decision pipelines.

ai-security prompt-injection llm-security threat-simulation guardrail-bypass

Updated Dec 5, 2025
Python

musabdulai-io / ai-security-scanner

Star

Security scanner for LLM/RAG applications - Test for prompt injection, jailbreaks, PII leakage, hallucinations & more

cli security jailbreak owasp penetration-testing red-team ai-safety vulnerability-scanner ai-security rag fastapi pii-detection llm prompt-injection llm-security

Updated Dec 14, 2025
Python

StrategicPromptArchitect-AI / MalPromptSentinel-CC-Skill

Star

MalPromptSentinel (MPS) is a Claude Code skill that detects malicious prompts in uploaded files before Claude processes them. It provides two-tier scanning to identify prompt injection attacks, role manipulation attempts, privilege escalation, and other adversarial techniques.

ai-security llm-security prompt-security claude-skill prompt-injection-detection malicious-prompt-scanner ai-prompt-protection

Updated Nov 27, 2025
Python

Hyperceptron / SemFire

Star

Agentic Defense Toolkit

semantic-analysis policy-engine llm-security

Updated Nov 14, 2025
Python

guardrail-labs / llm-guardrail-api-next

Star

Open-source enforcement layer for LLM safety and governance — ingress/egress evaluation, policy packs, verifier support, and multimodal protection.

ai-safety policy-engine fastapi complaince prompt-injection llm-security llm-firewall guardrail-api

Updated Nov 29, 2025
Python

metawake / puppetry-detector

Star

**Puppetry Detector** is a modular engine for detecting structured and semantic prompt injection attacks against Large Language Models (LLMs).

python nlp nlp-machine-learning ai-security large-language-models llm prompt-injection llm-security llm-moderation

Updated Apr 27, 2025
Python

jesushova / LLM-Prompt-Injection-Security

Star

Research and defense implementation for prompt injection vulnerabilities in LLM applications

python owasp cybersecurity owasp-top-10 ai-security prompt-injection llm-security ai-security-testing secureflag

Updated Oct 17, 2025
Python

jacklatrobe / MCP-Guardian

Star

MCP Guardian acts as a proxy service for remote MCP endpoints, and constantly polls them to make sure they haven't been compromised or modified.

security mcp prompt-injection llm-security mcp-server mcp-client

Updated Oct 30, 2025
Python

MacTash / HF-scanner

Star

A cross-provider AI model security scanner that evaluates HuggingFace, OpenRouter, and Ollama models for malicious content, unsafe code, license issues, and known vulnerabilities. Includes automated reports and risk scoring.

risk-analysis cybersecurity vulnerability-scanners security-scanner ai-security license-compliance huggingface risk-scoring metadata-analyzer metadata-analysis cybersecurity-tools ml-security llm-security ai-security-tool ai-red-teaming license-compliant-ai model-auditing model-security

Updated Nov 26, 2025
Python

matthernet / LLM-security-check

Star

CLI tool that uses the Lakera API to perform security checks in LLM inputs

ai artificial-intelligence ai-security large-language-models llm llm-security

Updated Mar 13, 2024
Python

awesome-software / llm-attacks

Star

Universal and Transferable Attacks on Aligned Language Models

llm-security

Updated Sep 19, 2023
Python

sookoothaii / llm-security-firewall

Star

Bidirectional LLM security firewall providing risk reduction (not complete protection) for human/LLM interfaces. Hexagonal architecture with multi-layer validation of inputs, outputs, memory and tool state. Beta status. ~528 KB wheel, optional ML guards.