Reference Architectures for Datalakes on AWS
-
Updated
May 13, 2020 - HTML
Reference Architectures for Datalakes on AWS
An idiomatic kotlin dataframe toolkit for data engineering tasks of any size dataset
📢🚨📣 Sciscinet-v2 is a refreshed update to SciSciNet which is a large-scale, integrated dataset designed to support research in the science of science domain.
O projeto constitui o MVP desenvolvido como requisito avaliativo da disciplina de Engenharia de Dados (código 40530010057_20250_02), integrante do curso de Especialização em Data Science and Analytics da Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio).
Develop the expertise to architect modern data ecosystems. Learn advanced database design, data modeling, cloud integration, and governance to deliver secure, efficient, and scalable enterprise data solutions.
Neste projeto de Análise de Recursos Humanos, temos como objetivo responder questões-chave sobre gestão de talentos e rotatividade de colaboradores em uma empresa fictícia.
Provenance-first cadmium data lake with reproducible DuckDB and Parquet outputs.
Add a description, image, and links to the data-lake topic page so that developers can more easily learn about it.
To associate your repository with the data-lake topic, visit your repo's landing page and select "manage topics."