You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "AllenL (Jira)" <ji...@apache.org> on 2020/07/15 06:47:00 UTC
[jira] [Updated] (LUCENE-9428) merge index failed with checksum
failed (hardware problem?)
[ https://issues.apache.org/jira/browse/LUCENE-9428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
AllenL updated LUCENE-9428:
---------------------------
Description:
Recently, a procedure using ElasticSearch appeared merge Index Failed with the following exception information
{code:java}
[2020-07-03 13:37:34,113][ERROR][index.engine ] [Deathbird] [st-sess][4] failed to merge[2020-07-03 13:37:34,113][ERROR][index.engine ] [Deathbird] [st-sess][4] failed to mergeorg.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=31f090d9 actual=d9697caa (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/var/lib/elasticsearch/17412c54-f974-11e9-9eef-80615f029e06/nodes/0/indices/st-sess/4/index/_3jm_Lucene50_0.tim"))) at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:334) at org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:451) at org.apache.lucene.codecs.blocktree.BlockTreeTermsReader.checkIntegrity(BlockTreeTermsReader.java:333) at org.apache.lucene.codecs.perfield.PerFieldPostingsFormat$FieldsReader.checkIntegrity(PerFieldPostingsFormat.java:317) at org.apache.lucene.codecs.FieldsConsumer.merge(FieldsConsumer.java:96) at org.apache.lucene.index.SegmentMerger.mergeTerms(SegmentMerger.java:193) at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:95) at org.apache.lucene.index.IndexWriter.mergeMiddle(IndexWriter.java:4086) at org.apache.lucene.index.IndexWriter.merge(IndexWriter.java:3666) at org.apache.lucene.index.ConcurrentMergeScheduler.doMerge(ConcurrentMergeScheduler.java:588) at org.elasticsearch.index.engine.ElasticsearchConcurrentMergeScheduler.doMerge(ElasticsearchConcurrentMergeScheduler.java:94) at org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:626)[2020-07-03 13:37:34,203][WARN ][index.engine ] [Deathbird] [st-sess][4] failed engine [merge failed]org.apache.lucene.index.MergePolicy$MergeException: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=31f090d9 actual=d9697caa (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/var/lib/elasticsearch/shterm-17412c54-f974-11e9-9eef-80615f029e06/nodes/0/indices/st-sess/4/index/_3jm_Lucene50_0.tim"))) at org.elasticsearch.index.engine.InternalEngine$EngineMergeScheduler$1.doRun(InternalEngine.java:1237) at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748){code}
The exception shows that it may be a hardware problem. Try to check the hardware and find no exception. Check the command as follows:
# check device /dev/sda, /dev/sdb; but finds no hardware errors
using command: smartctl --xall /dev/sdx
# check message log /var/log/messages, no hardware problem happend
# The system has a state detection script, i get the system load recorded is normal, IOwait is very low
was:
Recently, a procedure using ElasticSearch appeared merge Index Failed with the following exception information
{code:java}
[2020-07-03 13:37:34,113][ERROR][index.engine ] [Deathbird] [st-sess][4] failed to merge[2020-07-03 13:37:34,113][ERROR][index.engine ] [Deathbird] [st-sess][4] failed to mergeorg.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=31f090d9 actual=d9697caa (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/var/lib/elasticsearch/17412c54-f974-11e9-9eef-80615f029e06/nodes/0/indices/st-sess/4/index/_3jm_Lucene50_0.tim"))) at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:334) at org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:451) at org.apache.lucene.codecs.blocktree.BlockTreeTermsReader.checkIntegrity(BlockTreeTermsReader.java:333) at org.apache.lucene.codecs.perfield.PerFieldPostingsFormat$FieldsReader.checkIntegrity(PerFieldPostingsFormat.java:317) at org.apache.lucene.codecs.FieldsConsumer.merge(FieldsConsumer.java:96) at org.apache.lucene.index.SegmentMerger.mergeTerms(SegmentMerger.java:193) at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:95) at org.apache.lucene.index.IndexWriter.mergeMiddle(IndexWriter.java:4086) at org.apache.lucene.index.IndexWriter.merge(IndexWriter.java:3666) at org.apache.lucene.index.ConcurrentMergeScheduler.doMerge(ConcurrentMergeScheduler.java:588) at org.elasticsearch.index.engine.ElasticsearchConcurrentMergeScheduler.doMerge(ElasticsearchConcurrentMergeScheduler.java:94) at org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:626)[2020-07-03 13:37:34,203][WARN ][index.engine ] [Deathbird] [st-sess][4] failed engine [merge failed]org.apache.lucene.index.MergePolicy$MergeException: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=31f090d9 actual=d9697caa (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/var/lib/elasticsearch/17412c54-f974-11e9-9eef-80615f029e06/nodes/0/indices/st-sess/4/index/_3jm_Lucene50_0.tim"))) at org.elasticsearch.index.engine.InternalEngine$EngineMergeScheduler$1.doRun(InternalEngine.java:1237) at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748){code}
The exception shows that it may be a hardware problem. Try to check the hardware and find no exception. Check the command as follows:
# check device /dev/sda, /dev/sdb; but finds no hardware errors
using command: smartctl --xall /dev/sdx
# check message log /var/log/messages, no hardware problem happend
# The system has a state detection script, i get the system load recorded is normal, IOwait is very low
> merge index failed with checksum failed (hardware problem?)
> -----------------------------------------------------------
>
> Key: LUCENE-9428
> URL: https://issues.apache.org/jira/browse/LUCENE-9428
> Project: Lucene - Core
> Issue Type: Bug
> Environment: lucene version:5.5.4
> jdk version :jdk1.8-1.8.0_231-fcs
> Reporter: AllenL
> Priority: Major
>
> Recently, a procedure using ElasticSearch appeared merge Index Failed with the following exception information
>
> {code:java}
> [2020-07-03 13:37:34,113][ERROR][index.engine ] [Deathbird] [st-sess][4] failed to merge[2020-07-03 13:37:34,113][ERROR][index.engine ] [Deathbird] [st-sess][4] failed to mergeorg.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=31f090d9 actual=d9697caa (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/var/lib/elasticsearch/17412c54-f974-11e9-9eef-80615f029e06/nodes/0/indices/st-sess/4/index/_3jm_Lucene50_0.tim"))) at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:334) at org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:451) at org.apache.lucene.codecs.blocktree.BlockTreeTermsReader.checkIntegrity(BlockTreeTermsReader.java:333) at org.apache.lucene.codecs.perfield.PerFieldPostingsFormat$FieldsReader.checkIntegrity(PerFieldPostingsFormat.java:317) at org.apache.lucene.codecs.FieldsConsumer.merge(FieldsConsumer.java:96) at org.apache.lucene.index.SegmentMerger.mergeTerms(SegmentMerger.java:193) at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:95) at org.apache.lucene.index.IndexWriter.mergeMiddle(IndexWriter.java:4086) at org.apache.lucene.index.IndexWriter.merge(IndexWriter.java:3666) at org.apache.lucene.index.ConcurrentMergeScheduler.doMerge(ConcurrentMergeScheduler.java:588) at org.elasticsearch.index.engine.ElasticsearchConcurrentMergeScheduler.doMerge(ElasticsearchConcurrentMergeScheduler.java:94) at org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:626)[2020-07-03 13:37:34,203][WARN ][index.engine ] [Deathbird] [st-sess][4] failed engine [merge failed]org.apache.lucene.index.MergePolicy$MergeException: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=31f090d9 actual=d9697caa (resource=BufferedChecksumIndexInput(MMapIndexInput(path="/var/lib/elasticsearch/shterm-17412c54-f974-11e9-9eef-80615f029e06/nodes/0/indices/st-sess/4/index/_3jm_Lucene50_0.tim"))) at org.elasticsearch.index.engine.InternalEngine$EngineMergeScheduler$1.doRun(InternalEngine.java:1237) at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748){code}
>
> The exception shows that it may be a hardware problem. Try to check the hardware and find no exception. Check the command as follows:
> # check device /dev/sda, /dev/sdb; but finds no hardware errors
> using command: smartctl --xall /dev/sdx
> # check message log /var/log/messages, no hardware problem happend
> # The system has a state detection script, i get the system load recorded is normal, IOwait is very low
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org