Lightweight PII redaction pipeline using Hugging Face NER + regex (Python) 96.5% accuracy
-
Updated
Dec 9, 2025 - Python
Lightweight PII redaction pipeline using Hugging Face NER + regex (Python) 96.5% accuracy
AutoRML: A framework for automatic RML mapping generation using semantic table annotations
Turn your unstructured documents (PDFs, text files, code, XML) into powerful, interactive knowledge graphs using the magic of Large Language Models (LLMs)! KnowledgeLens AI helps you understand complex information, discover hidden connections, and chat with your data.
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources
TorchicTab-Heuristic: Semantic Table Annotation with Wikidata
Intelligent classification system for internet entities (domains, companies, services) using rule-based labeling, machine learning and LLM verification
^[S]\s$NLP(^\[Super\]\s$Natural Language Processing) Note. https://en.wikipedia.org/wiki/Natural_language_processing
AI-powered interview transcript analysis platform built with FastAPI, LangGraph, and Next.js 15. Transform interview transcripts into actionable insights using Claude AI with timeline extraction, entity recognition, and sentiment analysis.
A Python library for generating and loading synthetic and real-world datasets tailored for graph-based applications.
omgdevelopment.org site source
A Streamlit web application that demonstrates Named Entity Recognition (NER) for news articles using spaCy models trained on the CoNLL-2003 dataset. The app allows users to input custom text and visualize recognized entities (Person, Organization, Location, Date, Misc) with both highlighted text and entity lists.
SentimentInsights is a Ruby gem for extracting actionable insights from qualitative survey responses. It provides sentiment analysis, key phrase extraction, and named entity recognition using multiple NLP providers including OpenAI, Claude and AWS Comprehend.
This NLP project focuses on surfacing globally relevant and important news, filtering out noise, and making it engaging and accessible through humor and intelligent summarization.
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
AI-powered document extractor for names, emails, and organizations. License: MIT
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
A Java-based AI Personal Assistant with NLP capabilities, including sentence detection, named entity recognition (NER), and chatbot interaction.
This repository contains the code for the paper "PoliToHFI at SemEval-2023 Task 6"
DIET Classifier mini implementation on pytorch.
Add a description, image, and links to the entity-recognition topic page so that developers can more easily learn about it.
To associate your repository with the entity-recognition topic, visit your repo's landing page and select "manage topics."