content-moderation

Here are 63 public repositories matching this topic...

steelcityamir / safe-content-ai

A fast accurate API for detecting NSFW images.

python api open-source machine-learning ai tensorflow image-processing image-classification content-moderation nsfw-detection

Updated May 31, 2024
Python

WanzhengZhu / Euphemism

Star

Self-Supervised Euphemism Detection and Identification for Content Moderation, IEEE S&P (Oakland) 2021

content-moderation euphemism-detection euphemism-identification

Updated Mar 26, 2025
Python

trylonai / gateway

Star

The Open Source Firewall for LLMs. A self-hosted gateway to secure and control AI applications with powerful guardrails.

self-hosted ai-safety content-moderation pii-redaction prompt-injection llm-security ai-gateway llm-firewall llm-guardrails

Updated Jun 25, 2025
Python

rh-ai-quickstart / lemonade-stand-assistant

Star

AI-powered customer service assistant with guardrails for safe, compliant interactions using an LLM and multiple detector models.

ai-safety content-moderation

Updated Feb 5, 2026
Python

KOKOSde / localmod

Star

Self-hosted content moderation API that outperforms Amazon Comprehend. 100% offline, your data never leaves your server. Text + Image moderation.

docker machine-learning privacy offline-first self-hosted spam-detection image-moderation content-moderation fastapi pii-detection toxicity-detection nsfw-detection prompt-injection llm-security

Updated Jan 1, 2026
Python

Social-AI-Studio / HatReD

Star

Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).

dataset hate-speech multimodal content-moderation hateful-memes

Updated Jun 15, 2023
Python

iMerica / safely

Sponsor

Star

An ML driven NSFW moderator for Slack 🚨👮‍♂️🤖

python slack docker data-science machine-learning deep-learning neural-network slackbot python3 artificial-intelligence content-moderation

Updated Feb 10, 2019
Python

vstorm-co / pydantic-ai-middleware

Star

Middleware library for pydantic-ai agents - before/after hooks for guardrails, logging, rate limiting, PII redaction, content moderation, and input validation.

python middleware async rate-limiting openai type-safe input-validation ai-safety ai-agents content-moderation pydantic guardrails pii-redaction llm anthropic ai-guardrails pydantic-ai

Updated Jan 20, 2026
Python

ymrohit / openscenesense

Star

OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.

metadata-extraction scene-detection video-analysis content-moderation audio-transcription accessibility-tools frame-selection ai-powered-summarization customizable-prompts-and-models real-time-video-insights surveillance-applications educational-content-analysis

Updated Sep 5, 2025
Python

printfoo / moderation-icwsm2019-aaai2020

Star

Content moderation papers @ ICWSM 2019 & AAAI 2020.

misinformation partisan-bias content-moderation

Updated May 14, 2021
Python

avrtt / telegram-content-moderator

Star

NLP/ViT-driven bot for detection & moredation of inappropriate content in Telegram groups

natural-language-processing telegram-bot transformers text-analysis torch python-telegram-bot image-analysis telegram-bot-api content-moderation vision-transformer text-moderation

Updated Mar 27, 2025
Python

LyricMind is an AI-powered cognitive firewall that analyzes song lyrics to protect against unconscious absorption of harmful content. Using customizable frameworks and LLMs, it detects risky patterns and integrates with music players to trigger protective actions when thresholds are exceeded.

Updated Jun 15, 2025
Python

soundstarrain / LLM-Filter-Probe

Star

一款针对 LLM 输入侧审查的精确逆向分析工具。自动定位 NewAPI、OneAPI 及任何实施基于字典规则进行 Prompt 过滤的 API 网关中的敏感关键词 | A precision reverse-engineering tool for LLM input censorship. Automatically pinpoints blocked keywords in NewAPI, OneAPI, and any API gateways enforcing dictionary-based prompt filtering.

Updated Dec 10, 2025
Python

darkwaves-ofc / nude-detect

Star

NudeDetect is a Python-based tool for detecting nudity and adult content in images. This project combines the capabilities of the NudeNet library, EasyOCR for text detection, and the Better Profanity library for identifying offensive language in text.

Updated Jan 1, 2025
Python

Responsible-AI-Labs / rail-score

Star

Python SDK

python machine-learning sdk artificial-intelligence compliance gdpr ai-safety ai-ethics content-moderation responsible-ai ai-evaluation llm rag-evaluation rail-score

Updated Nov 3, 2025
Python

pkdubey / content_moderation

Star

An AI-powered content moderation system using Python and Hugging Face Transformers. Combines rule-based filtering and machine learning to detect and block toxic, profane, and politically sensitive content, built for developers and communities to create safer, positive online spaces.

python nlp natural-language-processing transformers profanity-filter model-training python-project content-moderation huggingface toxicity-detection moderation-system community-safety ai-machine-learning

Updated May 23, 2025
Python

apparebit / shantay

Star

Trying to make sense of the EU's DSA Transparency DB

python bigdata eu transparency dsa content-moderation

Updated Jan 18, 2026
Python

digvijay2004 / sensitive-content-detection-ml

Star

Real-time ML agent that detects sensitive content in video streams for classroom, parental, and enterprise safety.

python machine-learning real-time computer-vision pytorch safety content-moderation

Updated Nov 27, 2025
Python

chigwell / rossmann-appearance-analyzer

Sponsor

Star

A new package is designed to analyze user inputs related to avoiding negative or unwelcome appearances on a Louis Rossmann video. It processes the text input to identify key factors or common pitfalls

best-practices text-analysis conversational-analysis content-moderation content-quality comment-filtering automated-feedback user-guidance user-input-processing negative-appearance-prevention pitfalls-identification structured-recommendations response-matching textual-data-focus message-evaluation

Updated Dec 21, 2025
Python

KvaytG / ru-toxicity-detector

Star

A simple toxicity detector.

Updated Feb 5, 2026
Python

Improve this page

Add a description, image, and links to the content-moderation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-moderation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

content-moderation

Here are 63 public repositories matching this topic...

steelcityamir / safe-content-ai

WanzhengZhu / Euphemism

trylonai / gateway

rh-ai-quickstart / lemonade-stand-assistant

KOKOSde / localmod

Social-AI-Studio / HatReD

iMerica / safely

vstorm-co / pydantic-ai-middleware

ymrohit / openscenesense

printfoo / moderation-icwsm2019-aaai2020

avrtt / telegram-content-moderator

urbanadventurer / LyricMind

soundstarrain / LLM-Filter-Probe

darkwaves-ofc / nude-detect

Responsible-AI-Labs / rail-score

pkdubey / content_moderation

apparebit / shantay

digvijay2004 / sensitive-content-detection-ml

chigwell / rossmann-appearance-analyzer

KvaytG / ru-toxicity-detector

Improve this page

Add this topic to your repo