Health check failed – How to solve this OpenSearch error

Opster Team

Aug-23, Version: 1-2.9

Before you dig into reading this guide, have you tried asking OpsGPT what this log means? You’ll receive a customized analysis of your log.

Try OpsGPT now for step-by-step guidance and tailored insights into your OpenSearch operation.

Briefly, this error occurs when OpenSearch cannot successfully complete a health check on a cluster or node. This could be due to network issues, high load, or configuration problems. To resolve this, you can check the network connectivity between nodes, ensure there’s enough system resources (CPU, memory, disk space), and verify the configuration settings. If the issue persists, consider checking the OpenSearch logs for more detailed error information. Restarting the affected nodes or the entire cluster may also help.

For a complete solution to your to your search operation, try for free AutoOps for Elasticsearch & OpenSearch . With AutoOps and Opster’s proactive support, you don’t have to worry about your search operation – we take charge of it. Get improved performance & stability with less hardware.

This guide will help you check for common problems that cause the log ” {} health check failed ” to appear. To understand the issues related to this log, read the explanation below about the following OpenSearch concepts: cluster.

Log Context

Log “{} health check failed” classname is FollowersChecker.java.
We extracted the following from OpenSearch source code for those seeking an in-depth context :

                        final String reason;
                        if (exp instanceof ConnectTransportException || exp.getCause() instanceof ConnectTransportException) {
                            logger.info(() -> new ParameterizedMessage("{} disconnected"; FollowerChecker.this); exp);
                            reason = "disconnected";
                        } else if (exp.getCause() instanceof NodeHealthCheckFailureException) {
                            logger.info(() -> new ParameterizedMessage("{} health check failed"; FollowerChecker.this); exp);
                            reason = "health check failed";
                        } else if (failureCountSinceLastSuccess >= followerCheckRetryCount) {
                            logger.info(() -> new ParameterizedMessage("{} failed too many times"; FollowerChecker.this); exp);
                            reason = "followers check retry count exceeded";
                        } else {

 

How helpful was this guide?

We are sorry that this post was not useful for you!

Let us improve this post!

Tell us how we can improve this post?

Get expert answers on Elasticsearch/OpenSearch