BE
The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈ
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An Open Standard for lineage metadata collection
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared β¦
Realtime Web Apps and Dashboards for Python and R
Apache Beam is a unified programming model for Batch and Streaming data processing.
Always know what to expect from your data.
Roadmap to becoming a data engineer in 2021
lakeFS - Data version control for your data lake | Git for data
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL
Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB
A type-safe Postgres query builder for TypeScript.
Swagger UI is a collection of HTML, JavaScript, and CSS assets that dynamically generate beautiful documentation from a Swagger-compliant API.
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
An SWT based API for managing users and issuing SWT tokens.
π₯ π₯ π₯ Open Source Airtable Alternative
π SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs