Skip to main content
search

OpenSearch Core

OpenSearch Core is a high-performance search engine built on the robust indexing and search capabilities of Apache Lucene.

It allows you to ingest large volumes of content in diverse formats using OpenSearch Data Prepper, index complex multidimensional data, and perform efficient searches that return highly accurate results.

You can then explore and visualize your data seamlessly with OpenSearch Dashboards.

A powerful and versatile search and analytics engine

Robust search and indexing based on Apache Lucene

Powerful, open, designed to scale

OpenSearch is a fast, flexible, open-source suite for search and analytics. From log data to real-time monitoring, it delivers enterprise-grade features like security, alerting, and machine learning—without vendor lock-in. Its distributed design scales easily and keeps you in control.

Built on Apache Lucene

 Apache Lucene™ is a high-performance, full-featured search engine library written entirely in Java. It is a technology suitable for nearly any application that requires structured search, full-text search, faceting, nearest-neighbor search across high-dimensionality vectors, spell correction or query suggestions.

OpenSearch Core components

Fault tolerant, scalable components

The OpenSearch Core architecture is made up of clusters, nodes, indexes, shards, and documents. At the top level is the OpenSearch cluster, a distributed network of nodes, each responsible for different cluster operations based on its type. Data nodes are responsible for storing indexes—logical groupings of documents—and handling tasks like data ingestion, search, and aggregation.

Each index is divided into shards, which include both primary and replica data. Shards are distributed across multiple machines, enabling horizontal scaling for improved performance and efficient use of storage.

OpenSearch’s sharding strategy

A diagram of OpenSearch clusters

OpenSearch Clusters are highly scalable and exceptionally resilient.

Scalable by design

 Handle growing data volumes effortlessly with horizontal scaling and distributed architecture.

Open and flexible

 Fully open-source with support for custom plugins, fine-tuned queries, and community-driven innovation.

Real-time insights

 Run lightning-fast indexing, searches, and analytics on streaming or historical data for instant visibility.

Secure and extensible

 Built-in security features, role-based access control, audit logging, and compliance.

Key features

Lexical search

Perform traditional keyword-based queries using the Okapi BM25 algorithm.

Semantic search

Incorporate text embedding models and advanced vector search capabilities for interpreting the meaning and context of queries.

Hybrid search

Combine keyword and semantic search to improve search relevance and accuracy.

Vector engine

Execute k-NN searches and manage vector data alongside other data types with OpenSearch Core’s integrated vector database.

Intuitive dashboards

Render your search data and use natural language instructions to create visualizations with OpenSearch Dashboards.

Dynamic scaling

Supports enterprise workloads using both horizontal and vertical scaling, as well as native vector capabilities that can handle billions of vectors.

Extensive plugins

Add new features and capabilities to OpenSearch core using its expansive library of pre-built plug-ins, with more added on a regular basis.

Benefits

Established and trusted

OpenSearch Core is built on Apache Lucene’s scalable, high-performance search library, continuously updated to meet evolving performance requirements.

Flexible and adaptable

OpenSearch Core is highly customizable, making it adaptable to a diverse range of search applications and AI use cases.

Extensible and open

 With dozens of plugins available, and more supported with new releases, you can easily add new features or build your own.

Getting started

Check out the installation quickstart to get started using OpenSearch right away.

Access numerous bundled plugins or install additional plugins to customize your OpenSearch platform.

Review the OpenSearch performance benchmarks to view the results of ongoing performance testing.

Most recent OpenSearch blog posts

November 26, 2025 in Blog

Asymmetric distance computation for binary quantization

Learn how asymmetric distance computation (ADC) in OpenSearch achieves 32-fold memory compression for vector search while maintaining better recall than traditional binary quantization methods.
Read More
November 25, 2025 in Blog

Better observability, deeper insights: OpenSearch’s new Piped Processing Language capabilities

OpenSearch 3.3, PPL, Piped Processing Language, log analytics, observability, Apache Calcite, data analysis, time-series analysis, log processing, data visualization, OpenSearch 3.3, database querying, data monitoring, security analytics
Read More
November 24, 2025 in Blog

Introducing agentic search in OpenSearch: Transforming data interaction through natural language

With OpenSearch 3.3, we are excited to introduce _agentic search_, a major evolution in the way you interact with your data. Agentic search lets you explore your data using a…
Read More
November 21, 2025 in Blog

Adaptive refresh for resilient segment replication

Segment replication is a powerful index replication strategy for high QPS workloads that allows physical isolation and decoupling of indexing and search workloads, rapid replica failovers, and seamless point-in-time restores.…
Read More
November 21, 2025 in Error Logs

Error log: InvalidIndexNameException – The Illegal Name

Error log: You will see this error when you try to create a new index with a name that doesn't follow the rules. {   "error" : {     "root_cause" : ,…
Read More
November 21, 2025 in Error Logs

Error log: Unassigned Shards – Why Your Cluster is Showing Yellow or Red

Error Log: You won't typically see "unassigned_shards" directly in your log files as an error message. Instead, this is a status indicator you'd find when checking your cluster health: JSON…
Read More