Data architecture

< Back to all guides
An Overview of Source Filtering, Stored Fields, Fields and Docvalues Fields

There are various methods for retrieving fields in Elasticsearch, including: _source, stored_fields, fields & docvalue_fields. To retrieve...

Creating an Index with Mapping in Elasticsearch

In this article, we will delve into the process of creating an index with mapping in Elasticsearch. Mapping is the process of defining how...

Data Prepper vs. Logstash – A Complete Comparison

In this guide, we'll compare Logstash, the flagship ingestion tool of the ELK & Data Prepper, OpenSearch’s ingestion tool response.

Elasticsearch Cardinality - Low + High Cardinality Fields

In this guide, we'll go over Elasticsearch cardinality, high & low cardinality fields, how to determine field cardinality, and give examples.

Elasticsearch Change Field Type: A Comprehensive Guide

This article guides you through the process of changing the field type in Elasticsearch, including the reasons for doing so...

Elasticsearch Compression

Elasticsearch provides various compression techniques to optimize storage and network usage. Here's how to compress an index, source and...

Elasticsearch Data Stream

The Elasticsearch data stream is an abstraction layer between the names used by applications to facilitate ingestion and search operations...

Elasticsearch Delete Document

In this guide, we will discuss the process of deleting documents in Elasticsearch as well as best practices for deleting documents.

Elasticsearch doc-values-only Fields

This guide explains the purpose of the doc values structure and shows examples of usage of the Elasticsearch doc-values-only fields.

Elasticsearch Document Size Limit

This guide discusses the default document size limit in Elasticsearch, the reasons behind it, and how to handle larger documents effectively.

Elasticsearch Dynamic Mapping: Advanced Insights and Best Practices

In this article, we will delve into the advanced aspects of dynamic mapping, its benefits, and best practices for optimizing its usage.

Elasticsearch Field Size - How to Calculate the Storage Size of Specific Fields in an Index 

The 3 main methods in Elasticsearch to calculate the storage size of specific fields in an index are: using the _disk_usage API, creating...

Elasticsearch Filtered Aliases

Elasticsearch filtered aliases can help you filter data more efficiently. To create a filtered alias, you need to define the criteria for...

Elasticsearch Global Ordinals, Eager Global Ordinals & High Cardinality Fields

Terms aggregations rely on an internal data structure known as global ordinals. The eager_global_ordinals parameter is used to...

Elasticsearch Hot Warm

This article will discuss the benefits of using a hot-warm architecture and provide a step-by-step guide to setting up...

Elasticsearch Index Alias

Learn step-by-step techniques for creating, managing, and leveraging Elasticsearch's index aliases effectively.

Elasticsearch Index Pattern

Elasticsearch index patterns allow you to define how to match & interact with multiple indices. The best practices for index pattern usage...

Elasticsearch Index Settings

Discover how to use Elasticsearch index settings. Learn the difference between static and dynamic settings, methods for updating them & more.

Elasticsearch Insert Document

This guide will discuss best practices & performance optimization techniques for inserting documents into Elasticsearch. First, use the API...

Elasticsearch Json Array

Learn how to use JSON Arrays in Elasticsearch with this comprehensive guide on Indexing, Querying, and Dealing with Nested JSON Structures.

Elasticsearch match_only_text Field Type (For Storage Optimization)

The new match_only_text feature in Elasticsearch can save up to 10% of disk space on logging datasets. This field type will set a flat...

Elasticsearch Nested Aggregation 

Elasticsearch nested aggregation is a powerful technique for analyzing complex data structures that contain nested documents. To use...

Elasticsearch Not Indexed Field

This guide focuses on how to handle not indexed fields in Elasticsearch, including use cases, how to configure them, and examples.

Elasticsearch Null Value

Learn how to filter and find null values in Elasticsearch when searching for documents where the field is null

Elasticsearch Searchable Snapshots

Elasticsearch searchable snapshots allow data exploration. They can be controlled with ILM Policies or be manually mounted. To implement...

Elasticsearch Split Index API

By using the Elasticsearch Split Index API, an existing index can be split to create a new index with extra primary shards. To do this...

Elasticsearch Stop Words

This guide will explore the use of stop words in Elasticsearch, providing examples and step-by-step instructions.

Elasticsearch Stored Fields

Learn how stored fields differ from _source field, how to configure them for specific use-cases, their advantages and limitations.

Elasticsearch Sum Aggregation

The Elasticsearch sum aggregation allows you to calculate the sum of a numeric field for a set of documents. To implement...

Elasticsearch Synonym Token Filter

A tokenizer decides how Elasticsearch will take a set of words and divide it into separated terms called “tokens”. To work with synonyms...

Elasticsearch Term Vectors

Learn how to enable and retrieve Term Vectors, and explore use-cases in information retrieval, text classification, and more.

Elasticsearch Text Analyzers - Tokenizers, Standard Analyzers, Stopwords and More

The text analysis process is tasked with two functions: tokenization and normalization and is carried out by employing analyzers. When you...

Elasticsearch Time Series Data Stream (TSDS) 

Elasticsearch Time Series Data Stream (TSDS), is an optimized way to store time-based metrics. To Implement Time Series Data Streams...

Elasticsearch Transform APIs

The Elasticsearch Transform APIs can be used to turn existing indices into summarized indices. To create a transform API...

Elasticsearch Upsert

Upsert in Elasticsearch allows you to perform both update and insert actions in a single request. To perform an upsert operation...

Excluding Elasticsearch Fields from Indexing

This article will discuss the reasons for excluding fields from indexing, how to configure Elasticsearch to exclude specific fields...

Handling Arrays in Elasticsearch: Best Practices and Advanced Techniques

In this article, we will discuss advanced techniques and best practices for working with arrays in Elasticsearch. Mapping an array can...

Harnessing the Power of JSON in Elasticsearch

This article will delve into the advanced usage of JSON in Elasticsearch, focusing on its role in document structure, search queries...

How Does Elasticsearch Handle Uppercase vs. Lowercase Letters in the Aggs Order?

Elasticsearch sorts uppercase & lowercase letters separately, impacting results. Normalizers can be used to provide...

How to Aggregate Multiple Events in Logstash

This guide will show you how to leverage Logstash to aggregate multiple events that share some common data.

How to configure all Elasticsearch node roles (master, data, coordinating..)

Follow these steps to configure all Elasticsearch node role types (master, data, coordinating, ingest, machine learning, remote eligible...

How to Define Efficient Mapping in Elasticsearch

Mappings are the core element of index creation in Elasticsearch. Defining them correctly can improve performance. Mapping types include...

How to leverage ingest pipelines to transform data transparently in Elasticsearch

Ingest pipelines sit within the Elasticsearch node and will perform a set of alterations on your data that you...

How to Model Relationships Between Documents in Elasticsearch Using Join

The join data type field allows users to establish parent-child relationships between documents in Elasticsearch. To use it, you need to...

How to Model Relationships Between Documents in Elasticsearch Using Nesting

Elasticsearch has many methods for defining relationships between documents, such as nested documents. To use the nested field type...

How to Model Relationships Between Documents in Elasticsearch Using Object

Object types in Elasticsearch can be used to define relationships between documents. Here's how to use the object field type for that purpose.

How to Reduce the Number of Shards in an Elasticsearch Cluster

When you have too many shards in your Elasticsearch cluster, there are a few steps you can take in order to reduce the number of shards...

Index Lifecycle Management & Policy

Index lifecycle management helps automate the creation, management & removal of an Elasticsearch index. Define the index lifecycle policy...

Index Templating in Elasticsearch - How to Use Composable Templates

Elasticsearch index templates allow us to create indices with user defined configuration. An index can pull the configuration from these...

Leveraging Elasticsearch Script Fields: A Comprehensive Guide with Examples

This article will delve into the practical applications of script fields in Elasticsearch, offering examples to illustrate their usage.

Mastering Dynamic Templates in Elasticsearch

Dynamic templates in Elasticsearch allows you to define custom mappings that can be applied to dynamically added fields. To create...

Mastering Elasticsearch Custom Analyzers for Enhanced Search Capabilities

An Elasticsearch custom analyzer is defined by combining a single tokenizer with 0 or more token filters and character filters. To create...

Mastering Elasticsearch Normalizers for Improved Text Analysis

Elasticsearch normalizers are a crucial component in the text analysis process. Here is how they work, the best practices for usage...

Mastering GeoPoint Data Types in Elasticsearch

This guide delves into the intricacies of the Elasticsearch geo_point data type, its use cases & implementation and covers geospatial queries.

Mastering Timestamps in Elasticsearch

This article delves into the advanced aspects of handling timestamps in Elasticsearch, including indexing, querying, and formatting.

Object Fields VS. Nested Field Types in Elasticsearch

Nested is a special object type that is indexed as a separate document. To demonstrate the use of Elasticsearch nested VS. object fields...

Optimizing Schema Design in Elasticsearch: Techniques & Best Practices

In this article, we will discuss some advanced techniques and best practices for optimizing schema design in Elasticsearch.

SLM in Elasticsearch vs Snapshot Management in OpenSearch

Here are the similarities and differences between Elasticsearch Snapshot Lifecycle Management (SLM) and OpenSearch Snapshot Management (SM).

When You Should Transform Your Data Instead of Using Aggregations

There are at least three use cases where you should consider using transforms instead of aggregations in Elasticsearch. First, when the...

g2crowd

Loved by the Community ❤️

Our products are loved by over 12,000 users from the community. Thank you to our customers from all around the world!
soc2-type2