Error while reading global checkpoint from translog – How to solve this OpenSearch error

Opster Team

Aug-23, Version: 1-1.1

Before you dig into reading this guide, have you tried asking OpsGPT what this log means? You’ll receive a customized analysis of your log.

Try OpsGPT now for step-by-step guidance and tailored insights into your OpenSearch operation.

Briefly, this error occurs when OpenSearch encounters an issue while trying to read the global checkpoint from the transaction log (translog). This could be due to corruption in the translog, disk I/O issues, or a bug in OpenSearch. To resolve this, you can try the following: 1) Restart the OpenSearch node, 2) Check the disk for errors and fix them if any, 3) If the translog is corrupted, you may need to delete it and let OpenSearch rebuild it from the index, but this could lead to data loss, 4) Upgrade OpenSearch to the latest version to fix any potential bugs.

For a complete solution to your to your search operation, try for free AutoOps for Elasticsearch & OpenSearch . With AutoOps and Opster’s proactive support, you don’t have to worry about your search operation – we take charge of it. Get improved performance & stability with less hardware.

This guide will help you check for common problems that cause the log ” error while reading global checkpoint from translog; ” to appear. To understand the issues related to this log, read the explanation below about the following OpenSearch concepts: indices, recovery.

Log Context

Log “error while reading global checkpoint from translog;” classname is PeerRecoveryTargetService.java.
We extracted the following from OpenSearch source code for those seeking an in-depth context :

            try {
                final String expectedTranslogUUID = metadataSnapshot.getCommitUserData().get(Translog.TRANSLOG_UUID_KEY);
                final long globalCheckpoint = Translog.readGlobalCheckpoint(recoveryTarget.translogLocation(); expectedTranslogUUID);
                assert globalCheckpoint + 1 >= startingSeqNo : "invalid startingSeqNo " + startingSeqNo + " >= " + globalCheckpoint;
            } catch (IOException | TranslogCorruptedException e) {
                logger.warn(new ParameterizedMessage("error while reading global checkpoint from translog; " +
                    "resetting the starting sequence number from {} to unassigned and recovering as if there are none"; startingSeqNo); e);
                metadataSnapshot = Store.MetadataSnapshot.EMPTY;
                startingSeqNo = UNASSIGNED_SEQ_NO;
            }
        } catch (final org.apache.lucene.index.IndexNotFoundException e) {

 

How helpful was this guide?

We are sorry that this post was not useful for you!

Let us improve this post!

Tell us how we can improve this post?

Get expert answers on Elasticsearch/OpenSearch