Stars
Robust Speech Recognition via Large-Scale Weak Supervision
Hunt down social media accounts by username across social networks
Get your documents ready for gen AI
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
TradingAgents: Multi-Agents LLM Financial Trading Framework
Portable file server with accelerated resumable uploads, dedup, WebDAV, SFTP, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
q - Run SQL directly on delimited files and multi-file sqlite databases
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Printer Exploitation Toolkit - The tool that made dumpster diving obsolete.
Python's missing "algorave" module. Live code music with Python using MIDI, OSC and/or SuperCollider.