Deidentify people's names and gender specific pronouns
-
Updated
May 3, 2025 - Python
Deidentify people's names and gender specific pronouns
Clean your Text for Statistical ML and Language Model
This repository contains the code and data for the text re-identification attack presented in B. Manzanares-Salor, D. Sánchez, P. Lison, Evaluating the disclosure risk of anonymized documents via a machine learning-based re-identification attack, Data Mining and Knowledge Discovery, 2024.
This repository contains the code and data for the text anonymization enhancement method presented in B. Manzanares-Salor, D. Sánchez, Enhancing text anonymization via re-identification risk-based explainability, Submitted, 2024.
🛡️ Text Anonymizer using NuNER Zero-shot An academic project that auto-detects & masks sensitive entities (names, orgs, locations) in .txt/.docx files using Zero-shot NER. Features CLI & REST API. Replaces data with realistic fakes (Faker) or [REDACTED]. Built with Python, Flask, GLiNER.
Add a description, image, and links to the text-anonymization topic page so that developers can more easily learn about it.
To associate your repository with the text-anonymization topic, visit your repo's landing page and select "manage topics."