-
Datacatessen, LLC
- Baltimore, MD
- https://datacatessen.com
Stars
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
This repository hold the Amazon Elastic MapReduce sample bootstrap actions
A distributed file system implemented in Python
Papers from the computer science community to read and discuss.
The fanciest streaming word count you ever seen
Repository for MapReduce Design Patterns (O'Reilly 2012) example source code