Stars
Free and Open Source, Distributed, RESTful Search Engine
Free universal database tool and SQL client
Learn System Design concepts and prepare for interviews using free resources.
A browser automation framework and ecosystem.
Apache Kafka - A distributed event streaming platform
🔎 Open source distributed and RESTful search engine.
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Apache JMeter open-source load testing tool for analyzing and measuring the performance of a variety of services
An Application Framework for AI Engineering
Pentaho Data Integration ( ETL ) a.k.a Kettle
Tutorials for using RabbitMQ in various ways
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
A machine learning software for extracting information from scholarly documents
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Open, Multi-modal Catalog for Data & AI
Hopsworks - Data-Intensive AI platform with a Feature Store
Mondrian is an Online Analytical Processing (OLAP) server that enables business users to analyze large quantities of data in real-time.
Low Level Designs of common data structures. These designs keep concurrency control, latency and throughput in mind. We use design patterns where applicable to make the code readable, extensible an…