24 Oct 25
WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV and Parquet files, store them in tables, and perform SQL queries on the data.WhatTheDuck is a Python library available on GitHub that serves as a high-performance bridge for seamless data transfer and integration between the DuckDB analytical database and Pandas DataFrames.
by tmfnk
2 months ago
14 Jul 25
20 Apr 25
An open-source tool for reading OpenStreetMap PBF files using DuckDB.
- Scalable reader for OpenStreetMap ProtoBuffer (pbf) files.
- Is based on top of DuckDB1 with its Spatial2 extension.
- Saves files in the GeoParquet3 file format for easier integration with modern cloud stacks.
- Utilizes multithreading unlike GDAL that works in a single thread only.
- Can filter data based on geometry without the need for ogr2ogr clipping before operation.
- Can filter data based on OSM tags.
- Utilizes caching to reduce repeatable computations.
- Can be used as Python module as well as a beautiful CLI based on Typer4.
by agnieszka
8 months ago