Starred repositories
Command-line program to download videos from YouTube.com and other video sites
Financial data platform for analysts, quants and AI agents.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
scikit-learn: machine learning in Python
The simplest, fastest repository for training/finetuning medium-sized GPTs.
LlamaIndex is the leading document agent and OCR platform
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
The fundamental package for scientific computing with Python.
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Turn (almost) any Python command line program into a full GUI application with one line
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
HTTP Request & Response Service, written in Python + Flask.
Static site generator that supports Markdown and reST syntax. Powered by Python.
A vector index built on TurboQuant, written in Rust with Python bindings
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.