-
Databricks
- Berlin
- https://baunsgaard.github.io/
- https://orcid.org/0009-0001-1463-7294
Highlights
- Pro
Stars
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
TerseTS is a collection of lossless and lossy time series compression methods.
A LaTeX template for a basic DFG (Deutsche Forschungsgemeinschaft, German Research Foundation) grant proposal.
Apache Spark - A unified analytics engine for large-scale data processing
Feature Clock: High-Dimensional Effects in Two-Dimensional Plots
An open source ML system for the end-to-end data science lifecycle
✂️ Fast slice finding for Machine Learning model debugging.
Baunsgaard / systemds
Forked from apache/systemdsMirror of Apache SystemML
DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines
An implementation of a variant of the MCxxxx programming language from Shenzhen I/O™
JDK main-line development https://openjdk.org/projects/jdk
BI benchmark with user generated data and queries