Stars
Production-grade Arrow FlightSQL gateway in front of DuckDB Quack + DuckLake. Multi-tenant pools, pluggable auth (DB/JWT/OIDC), table-level ACLs, role-aware routing, and a live admin console
SQLGLot vs. JSQLParser speed comparison
from vibe coding to agentic engineering - practice makes claude perfect
developers.events is a community-driven platform listing developer/tech conferences and Calls for Papers (CFPs) worldwide with a list, a calendar and a map view. It helps organizers, speakers, spon…
Privacy-first API platform built with Tauri v2. No login, no cloud, ~60 MB RAM. A lightweight Postman alternative.
A modular Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.
A script from Mike O'Driscoll to toggle Tailscale exit nodes from a GL.iNet physical switch.
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
BigTesty is a framework that allows to create Integration Tests with BigQuery on a real and short lived Infrastructure.
SoftClient4ES is a modular and version-resilient interface built on top of Elasticsearch clients, providing a unified and stable API that simplifies migration across Elasticsearch versions, acceler…
Surfalytics projces on Data Engineering and Analytics
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
The best place to learn data engineering. Built and maintained by the data engineering community.
This is a public repository to go over all the LLM-driven data engineering concepts.
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
Create a Chatbot app on your own data with GCP tools
hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Create BigQuery views that unify sets of table with the same prefix and different versions.
Introduction à la science des données et à l’intelligence artificielle
A curated list of resources for learning about Google Cloud Platform certifications and how to prepare for it.
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
by ex-googlers, for ex-googlers - a lookup table of similar tech & services