Stars
QGIS is a free, open source, cross platform (lin/win/mac) geographical information system (GIS)
🍺 The missing package manager for macOS (or Linux)
An Open Source Machine Learning Framework for Everyone
Apache Spark - A unified analytics engine for large-scale data processing
A framework for building native applications using React
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Kepler.gl is a powerful open source geospatial analysis tool for large-scale data sets.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
A browser automation framework and ecosystem.
LlamaIndex is the leading document agent and OCR platform
Declaratively deploy your Kubernetes manifests, Kustomize configs, and Charts as Helm releases. Generate all-in-one manifests for use with ArgoCD.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
Apache Superset is a Data Visualization and Data Exploration Platform
The official home of the Presto distributed SQL query engine for big data
Streamlit — A faster way to build and share data apps.
An extremely fast Python package and project manager, written in Rust.
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
Always know what to expect from your data.
Universal Command Line Interface for Amazon Web Services
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs