- Paris
Highlights
- Pro
Stars
- All languages
- ANTLR
- Astro
- Batchfile
- C
- C#
- C++
- CQL
- CSS
- Clojure
- Cypher
- Cython
- Dockerfile
- Elixir
- GLSL
- Gleam
- Go
- Go Template
- Groovy
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LookML
- MDX
- Makefile
- Markdown
- Mustache
- Nextflow
- OCaml
- Objective-C
- PHP
- PLSQL
- PLpgSQL
- Pascal
- Perl
- Procfile
- Python
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- SQL
- Scala
- Scheme
- Shell
- Starlark
- Svelte
- Swift
- TeX
- TypeScript
- Vala
- Vue
- Web Ontology Language
- XSLT
- YAML
A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts
Example repository for managing Clickhouse schemas and migrations with declarative and versioned workflows
Finally, a good FUSE FS implementation over S3
Cross-platform, customizable ML solutions for live and streaming media.
Interfaces to query ClickHouse databases from PostgreSQL
LLM Agent skills for working with DataHub, search, enrich, quality, build connectors, ...
An Open Standard for lineage metadata collection
A extension for DuckDB, which captures lineage events for executed queries
k3s cluster managed by FluxCD GitOps
Multi-vendor, format-agnostic parser for sequencing sample sheets - Illumina IEM V1 & BCLConvert V2 plus Element AVITI run manifests, with index validation and color-balance checking
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tables. It powers Elementary OSS and feeds the wider context la…
Always know what to expect from your data.
A Deep Learning Python Toolkit for Healthcare Applications.
[Under development] A dbt ETL project to convert a Synthea synthetic data set into the OMOP CDM
A resource to convert the MIMIC-IV and MIMIC-IV note datasets into a standard OMOP format.
This is the development home of the workflow management system Snakemake. For general information, see
Apache Doris is an easy-to-use, high performance and unified analytics database.
Python SDK for OMOP/OHDSI vocabularies - query 10M+ medical concepts across SNOMED, ICD-10, RxNorm, LOINC & 90+ terminologies via simple API
Rivers is an orchestration platform for data and ML pipelines, written in Rust for native performance with a Python-first development experience.
Karapace - Your Apache Kafka® essentials in one tool
Karoo companion app for Japanese electronic shifting groupsets. Display gear/battery information and control Karoo ride screen.
DuckDB is an analytical in-process SQL database management system
Differential expression of RNA-seq data using the Negative Binomial
Aggregate results from bioinformatics analyses across many samples into a single report.
📙 Awesome Data Catalogs and Observability Platforms.