Stars
Free and Open Source, Distributed, RESTful Search Engine
A browser automation framework and ecosystem.
Logstash - transport and process your logs, events, or other data
OpenRefine is a free, open source power tool for working with messy data and improving it
Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.
Algorithms, 4th edition textbook code and libraries
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data
Solutions to all the exercises of the Algorithms book by Robert Sedgewick and Kevin Wayne
Source code for Big Data: Principles and best practices of scalable realtime data systems