GitHub - dorcha-inc/ceil-dlp: Open-Source DLP for LLMs and Agents that provides Conversational Coherence

Open-Source DLP for LLMs and Agentic Workflows with Conversational Coherence

ceil-dlp is a Data Loss Prevention (DLP) plugin for LiteLLM that automatically detects and protects Personally Identifiable Information (PII) in LLM requests. This includes PII in text, images, and PDFs. It blocks, masks, or uses reversible tokens to protect sensitive data before it reaches your LLM provider. This helps prevent you from leaking your secrets, API keys, and other sensitive information. If used with reversible tokens, ceil-dlp allows you to preserve conversational coherence so users can have natural conversations despite the DLP protections. Finally, ceil-dlp also helps you ensure compliance with data privacy regulations like HIPAA, PCI-DSS, GDPR, and CCPA.

Usage

Install ceil-dlp:

pip install ceil-dlp

Download required spaCy models:

python -m spacy download en_core_web_lg
python -m spacy download en_core_web_sm

Note: If using uv, use uv run python instead of python to ensure models are installed in the correct environment.

Then use the CLI to automatically configure LiteLLM:

ceil-dlp install path/to/config.yaml

This command will:

Create a local ceil_dlp_callback.py wrapper in the same directory as your LiteLLM config
Create a starter ceil-dlp.yaml configuration file
Automatically update your LiteLLM config.yaml to include the callback

Then run: litellm --config config.yaml --port 4000

To customize behavior, edit the generated ceil-dlp.yaml file in the same directory as your config.

To remove ceil-dlp from your configuration:

ceil-dlp remove path/to/config.yaml

This will remove the callback from your LiteLLM config. You can also use --remove-callback-file and --remove-config-file flags to remove the generated files.

Documentation

See the Quick Start Guide for a comprehensive, step-by-step tutorial with Ollama
Take a look at the example configuration file for all available options

About

ceil-dlp is an open-source solution that handles both PII + PHI (via Presidio) and secrets (API keys, tokens, credentials, etc.) in one integrated solution, eliminating the need to configure and maintain separate guardrails. ceil-dlp supports model-specific policies using pattern-based rules within a single policy definition, allowing you to configure different rules for different models directly in your configuration file. For example, you can block API keys or PII for an external model provider such as Anthropic or OpenAI while allowing them for locally hosted models. This can be done using simple regex patterns in your config, all without requiring separate guardrail definitions or per-request configuration.

ceil-dlp provides image and PDF support, detecting both PII and secrets in images + pdfs through OCR,. It applies automatically to all requests via LiteLLM's callback system, so you don't need to specify a guardrails parameter on every request It also supports both blocking and masking actions for all detection types, giving you full control over how sensitive data is handled.

An important feature in ceil-dlp is that it can do DLP while preserving conversarional coherence. In addition to masking, blocking, and observinb, ceil-dlp includes a separate DLP mode called whistledown. whistledown (based on our preprint), is designed to preserve conversational context by using consistent, reversible tokens instead of generic redaction markers.

Preserving Conversational Coherence

Traditional masking works something like this:

User: "My name is John Doe and my email is john@example.com"
-> LLM sees: "My name is [REDACTED_PERSON] and my email is [REDACTED_EMAIL]"
-> User sees: "Hello [REDACTED_PERSON]! I'll send details to [REDACTED_EMAIL]"

In whistledown mode, ceil-dlp does the following instead:

User: "My name is John Doe and my email is john@example.com"
-> LLM sees: "My name is PERSON_1 and my email is EMAIL_1"
-> LLM responds: "Hello PERSON_1! I'll send details to EMAIL_1"
-> User sees: "Hello John Doe! I'll send details to john@example.com"

In other words, ceil-dlp masks in a way that maintains a one-to-one correspondence between the original entities and the masked entities. ceil-dlp then automatically reverses the transformations in LLM responses, restoring the original values for the user. This maintains conversational flow while protecting sensitive data from being sent to external LLM providers.

To enable Whistledown mode, set the action to whistledown in your ceil-dlp.yaml configuration:

policies:
  person:
    action: whistledown  # Use consistent tokens instead of [REDACTED_PERSON]
    enabled: true
  
  email:
    action: whistledown
    enabled: true

Note that you can mix actions within the same configuration. For example, using whistledown for person names and emails while using block for credit cards and API keys.

Ensemble Model Architecture

ceil-dlp uses a unique ensemble approach that combines multiple models to maximize PII detection accuracy while minimizing false negatives. The ensemble architecture operates at two levels:

NER Ensemble

You can choose between three configurable detection strength levels. Level 1 uses spaCy's en_core_web_lg model, level 2 additionally uses a transformer-based model (dslim/bert-base-NER), and level 3 uses GLiNER (urchade/gliner_multi_pii-v1) in addition to the models in the previous levels. All models detect PII + PHI + Secrets on the original text independently. The detected results are then merged. This "detect everything first, merge later" approach ensures maximum coverage since each model sees the complete, unredacted text.

OCR Ensemble

For images and PDFs, ceil-dlp uses a sequential multi-pass OCR ensemble. This also has three configurable ocr strength levels. Level 1 uses a lightweight docTR model. Level 2 uses Tesseract as a second model. Finally, level 3 uses a more heavy-weight docTR model. Each OCR engine runs on the previously-redacted image in sequence. This sequential approach is helpful for images because OCR engines can still read surrounding context after redaction (unlike text where redaction destroys information). The intuition behind this is that different OCR engines have different strengths e.g some are better at handwriting, others at printed text etc. and that running multiple passes will catch PII that any single OCR engine might miss.

Existing LiteLLM Guardrails

LiteLLM offers built-in guardrails for many tasks involving LLM interaction security. However, we were unable to find a solution that helps with all the features a person or team working with sensitive data in a real-world LLM interaction would require.

To be more specific, LiteLLM provides two separate guardrails for data protection, each with significant limitations. LiteLLM's Presidio guardrail handles PII and PHI masking using Microsoft Presidio, but it does not handle secrets (API keys, tokens, credentials, etc.). Additionally, it only supports LiteLLM-wide configuration and cannot apply different policies to different models. It also seems to lack support for detecting PII in images and PDFs, only working with text content. LiteLLM's Secret Detection guardrail is an Enterprise-only feature that requires a paid license. While it can detect secrets and can be configured per model (by defining separate guardrail configurations), it only performs redaction and cannot block requests containing secrets. It also only works on text content and does not detect or redact secrets in images or PDFs.

The masking approach of these existing guardrails use generic [REDACTED] tags that break conversational coherence. When the LLM responds with [REDACTED], the user sees the same generic placeholder instead of their original values, making conversations difficult to follow. In contrast, ceil-dlp preserves conversational coherence by using consistent, reversible tokens (like PERSON_1, EMAIL_1) that are automatically converted back to original values in LLM responses, allowing natural conversations while protecting sensitive data.

Contributing

Contributions are always welcome! We'd love to have you contribute to ceil-dlp.

See CONTRIBUTING.md for development setup and guidelines
Read our Code of Conduct to understand our community standards
Check out SECURITY.md for security reporting guidelines

Releasing a New Version

To release a new version of ceil-dlp:

Update the version in pyproject.toml:
```
version = "1.2.0"
```

Commit the version change:

git add pyproject.toml
git commit -m "Bump version to 1.2.0"

Create and push a git tag:

git tag -a v1.2.0 -m "Release v1.2.0"
git push && git push --tags

The GitHub Actions workflow will automatically build the package and publish to PyPI when the tag is pushed

The publish workflow triggers on tags matching v* (e.g., v1.2.0). Make sure your changes are committed and pushed before creating the tag.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github		.github
.vscode		.vscode
ceil_dlp		ceil_dlp
docs		docs
examples		examples
scripts		scripts
share		share
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
config.example.yaml		config.example.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Usage

Documentation

About

Preserving Conversational Coherence

Ensemble Model Architecture

NER Ensemble

OCR Ensemble

Existing LiteLLM Guardrails

Contributing

Releasing a New Version

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Usage

Documentation

About

Preserving Conversational Coherence

Ensemble Model Architecture

NER Ensemble

OCR Ensemble

Existing LiteLLM Guardrails

Contributing

Releasing a New Version

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages