Improve your Elasticsearch Reindex Performance with these Tips

Improve your Elasticsearch Reindex Performance with these Tips

Last Updated : August 2020

Learn how to reindex Elasticsearch more efficiently and improve Elasticsearch reindexing performance by following these tips: 

  1. Disable Replicas

    Disable replicas when building a new index from scratch that is not serving the search traffic. Replicas can be changed dynamically later on once re-indexing has been completed.

  2. Disable Refresh Interval

    Disable refresh interval again. It can be changed once re-indexing has been completed.

  3. Use Bulk API

    Use the bulk API with multiple clients to get the maximum throughput from Elasticsearch (Benchmark Elasticsearch cluster to avoid any performance issues).

  4. Increase Buffer Size

    Increase index buffer size and use opster’s detailed documentation to fine-tune it.

  5. Use Reindex API

    If _source field is enabled and you are re-indexing in the case of changing analyzer on the existing fields (breaking changes), use Reindex API of elasticsearch.

  6. Disable Merge Throttling

    Disable merge throttling by changing the setting `indices.store.throttle.type` to none. If you have a massive write-heavy index, then you can make it permanent.

  7. Ensure Optimal Scalability Settings

    Choosing the optimal number of primary shards is crucial for scalability, which can’t be changed later on. Refer to opster’s guide to shards and replicas to understand more. Also, make sure you don’t end up creating “hotspots” in the cluster.

To get personalized detailed recommendations that can improve indexing and search performance run our Elasticsearch Check-Up (no installation required)!