-
unytics
- Paris
- https://www.linkedin.com/in/paul-marcombes
- in/paul-marcombes
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Event Driven Orchestration & Scheduling Platform for Mission Critical Applications
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
The Metadata Platform for your Data and AI Stack
Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
An Open Standard for lineage metadata collection
The open source, cloud native tool for API Mocking and Testing. Microcks is a Cloud Native Computing Foundation sandbox project 🚀
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Retail banking sample application showcasing Kubernetes and Google Cloud
Abixen Platform is a microservices based software platform for building enterprise applications delivering functionalities through creating particular microservices and integrating by provided CMS.
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
Cascading / cascading
Forked from cwensel/cascadingAll development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on vario…
Schema modelling framework for decentralised domain-driven ownership of data.
Examples of how to use Cloud Bigtable both with GCE map/reduce as well as stand alone applications.
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Utility to identify and rewrite common anti patterns in BigQuery SQL syntax
CATA.Search. Blockchain database, cata metadata query
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.
Official repository of SquashQL, the SQL query engine for multi-dimensional and hierarchical analysis that empowers your SQL database
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
A collection of Google Cloud Platform (GCP) plugins
The ZetaSQL Toolkit is a library that helps users use ZetaSQL Java API to perform SQL analysis for multiple query engines, including BigQuery and Cloud Spanner.