Skip to content
View cboden's full-sized avatar
📉
Munging data
📉
Munging data

Organizations

@reactphp @ratchetphp

Block or report cboden

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 2,212 242 Updated Jun 19, 2026

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 3,147 403 Updated Jun 18, 2026

Visualize and share your data. All in SQL. Powered by DuckDB.

Go 1,136 42 Updated Jun 15, 2026

A sleek, single-page React application that transforms JSON data into an interactive and visually appealing directed graph using React Flow.

JavaScript 23 1 Updated Jul 5, 2025

pg_lake: Postgres with Iceberg and data lake access

C 1,552 104 Updated Jun 19, 2026

PostgreSQL replication with DDL changes

Go 1,139 60 Updated Jun 18, 2026

Automated database platform for PostgreSQL® - Your own DBaaS.

TypeScript 4,269 594 Updated Jun 15, 2026

A collaborative note taking, wiki and documentation platform that scales. Built with Django and React.

Python 16,598 599 Updated Jun 19, 2026

The dbt-toolkit is an early-stage plugin designed to enhance your experience working with dbt-core projects in JetBrains IDEs.

Kotlin 34 Updated Mar 4, 2026

Document, sample code and other materials for SQLFlow

Python 1,038 191 Updated Apr 6, 2026

Code-playground to visualise complex engineering flows.

TypeScript 427 30 Updated Oct 8, 2024

🧠 Cognitive load is what matters

12,290 295 Updated Jun 13, 2026

An AI-powered Personal Identifiable Information (PII) scanner.

Python 732 63 Updated Jan 22, 2025

The API to search, scrape, and interact with the web at scale. 🔥

TypeScript 134,932 7,863 Updated Jun 19, 2026

Open-source platform for extracting structured data from documents using AI.

JavaScript 1,481 60 Updated May 15, 2025

Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake

Java 322 69 Updated Jun 15, 2026

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…

TypeScript 69,565 4,945 Updated Jun 19, 2026

📙 Awesome Data Catalogs and Observability Platforms.

1,042 78 Updated Aug 14, 2025

Entity Relation Diagrams generation tool

Python 1,416 132 Updated May 5, 2026

Database diagrams editor that allows you to visualize and design your DB with a single query.

TypeScript 22,411 1,417 Updated Jun 18, 2026

A curated list of data engineering tools for software developers

8,747 1,546 Updated Jun 16, 2026

Temporal service

Go 21,073 1,672 Updated Jun 19, 2026

JavaScript library for working with recurrence rules for calendar dates as defined in the iCalendar RFC and more.

TypeScript 3,720 548 Updated Jun 27, 2024

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.

Rust 9,343 885 Updated Jun 18, 2026

The data-validation toolkit for enhanced dbt (data build tool) PR review

TypeScript 460 26 Updated Jun 19, 2026

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python 345 99 Updated Jan 5, 2024

Collect, aggregate, and visualize a data ecosystem's metadata

Java 2,217 403 Updated Jun 18, 2026

Generate the ERD as a code from dbt artifacts

Python 329 34 Updated Jun 14, 2026
Next