You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Adrien Grand (JIRA)" <ji...@apache.org> on 2013/10/09 10:10:50 UTC
[jira] [Resolved] (LUCENE-5267) java.lang.ArrayIndexOutOfBoundsException on reading data

     [ https://issues.apache.org/jira/browse/LUCENE-5267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Adrien Grand resolved LUCENE-5267.
----------------------------------

    Resolution: Not A Problem

bq. dOff - matchDec <0, so throws java.lang.ArrayIndexOutOfBoundsException
bq. dest.length=33288,dOff=3184,matchDec=34510,matchLen=15,fastLen=16

Indeed, all the lines you pasted make no sense since matchDec should be lower than dOff. To me this really looks like your index got corrupted somehow. It could be a single corrupt byte that makes LZ4 read a length on 2 bytes instead of 1 and this shift makes LZ4 try to decompress bytes that make no sense at all, explaining why all matchDecs are all higher than dOff.

There are likely only a few chunks that are broken so if you want to try to get back as many documents as possible from the corrupt segment, the following piece of code may help https://gist.github.com/jpountz/6461246

> java.lang.ArrayIndexOutOfBoundsException on reading data
> --------------------------------------------------------
>
>                 Key: LUCENE-5267
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5267
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/index
>    Affects Versions: 4.4
>            Reporter: Littlestar
>            Assignee: Adrien Grand
>              Labels: LZ4
>
> java.lang.ArrayIndexOutOfBoundsException
> 	at org.apache.lucene.codecs.compressing.LZ4.decompress(LZ4.java:132)
> 	at org.apache.lucene.codecs.compressing.CompressionMode$4.decompress(CompressionMode.java:135)
> 	at org.apache.lucene.codecs.compressing.CompressingStoredFieldsReader.visitDocument(CompressingStoredFieldsReader.java:336)
> 	at org.apache.lucene.index.SegmentReader.document(SegmentReader.java:133)
> 	at org.apache.lucene.index.BaseCompositeReader.document(BaseCompositeReader.java:110)
> 	at org.apache.lucene.index.SlowCompositeReaderWrapper.document(SlowCompositeReaderWrapper.java:212)
> 	at org.apache.lucene.index.FilterAtomicReader.document(FilterAtomicReader.java:365)
> 	at org.apache.lucene.index.BaseCompositeReader.document(BaseCompositeReader.java:110)
> 	at org.apache.lucene.index.IndexReader.document(IndexReader.java:447)
> 	at org.apache.lucene.search.IndexSearcher.doc(IndexSearcher.java:204)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org