Stars
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to clo…
Get your documents ready for gen AI
💫 Industrial-strength Natural Language Processing (NLP) in Python
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Modin: Scale your Pandas workflows by changing a single line of code
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Automated Machine Learning with scikit-learn
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
A Python implementation of LightFM, a hybrid recommendation algorithm.
Glue is a simple command line tool to generate CSS sprites
Sixpack is a language-agnostic a/b-testing framework
[UNMAINTAINED] Automated machine learning for analytics & production
Asynchronous parallel SSH client library.
db.py is an easier way to interact with your databases
PySpark + Scikit-learn = Sparkit-learn
a security scanner for custom LLM applications
A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explana…
A library for Partially Homomorphic Encryption in Python
The world's cleanest AutoML library ✨ - Do hyperparameter tuning with the right pipeline abstractions to write clean deep learning production pipelines. Let your pipeline steps have hyperparameter …
Accelerated Excel XLSX Writing Library for Python 2/3
Simple Python client for interacting with Google BigQuery.