Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
-
Updated
Nov 7, 2025 - Ruby
Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Secure Vault for Customer PII/PHI/PCI/KYC Records
Secure, High-performance DICOM anonymization and metadata extraction for research and healthcare.
anonymaCy is a spaCy extension for anonymizing PII using rule-based recognizers, context-aware processing, conflict resolution and customizable anonymization.
Ethical AI project for matching candidates and vacancies through semantic NLP analysis. Uses BERT, Word2Vec, and spaCy to ensure fair and explainable recommendations. Developed for the Principles of Artificial Intelligence Technologies (PTIA) course at Escuela Colombiana de Ingeniería Julio Garavito.
A PHP library to back up, restore and anonymize databases
Simple yet powerful tool for identifying and anonymizing personal information in various formats.
Cinnamon is a modular application designed to offer robust functionalities for data anonymization, synthetization, and evaluation.
Maskwise detects, redacts, masks, and anonymizes sensitive data across text, images, and structured data in training datasets for LLM systems. Powered by Microsoft Presidio
The project is about Anonymousing data for mobile customer. This project is for showing how client's data should be secure and safe for data analyst
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
Chrome extension to anonymize Excel/CSV HR, payroll & equity comp data — 100% local, zero permissions.
ANJANA is a Python library for anonymizing sensitive data
Generate anonymized test dataset from production data and configurable anonymization sequences. Execute base to base (vendor agnostic) export and import
My portfolio website built with Next.js and ShadcnUI. Displays My Projects and Work Experience
Data anonymization project using ARX: applying k-anonymity with l-diversity and t-closeness to evaluate privacy-utility trade-offs on a sensitive dataset.
De-identify documents to prevent the risk of information leakage.
CSV fuzzer/anonymizer
Add a description, image, and links to the data-anonymization topic page so that developers can more easily learn about it.
To associate your repository with the data-anonymization topic, visit your repo's landing page and select "manage topics."