Stars
An end to end semantic and meta-data search engine for personal data.
Self-contained worked examples of Apache Lucene features and functionality
A simple Java library for interacting with Ollama server.
The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies
Flink Agents is an Agentic AI framework based on Apache Flink
🔍 Unified Search MCP Server - Search across Google Scholar, Web, and YouTube with a single query
A client for connecting and running DDLs on hive metastore.
A tool for visually designing and inspecting Elasticsearch index structures.
🔥 Seata is an easy-to-use, high-performance, open source distributed transaction solution.
The codebase for the book "AI-Powered Search" (Manning Publications, 2025)
Monolingual wordlists with pronunciation information in IPA
Mail Connector for Apache Beam / Google Cloud Dataflow
Event Driven Orchestration & Scheduling Platform for Mission Critical Applications
🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
Elasticsearch in Action Book
Example on how to deploy Apache beam, Spark Cluster on Kubernetes and run Python code
An artifact of fully-specified annotations to power static-analysis checks, beginning with nullness analysis.
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
Apache Fluss is a streaming storage built for real-time analytics.
Integrates LLMs as PTransform in Apache Beam pipelines using LangChain
An Apache Beam source to connect and consume data from TREP using the Websocket API.
The Dataflow Solution Guides offer full end-to-end deployment for the most common streaming solutions to run on Dataflow.