You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Alex Levenson (JIRA)" <ji...@apache.org> on 2015/06/17 18:19:00 UTC

[jira] [Resolved] (PARQUET-244) DeltaByteArrayReader fails with ArrayIndexOutOfBoundsException when moving across pages

     [ https://issues.apache.org/jira/browse/PARQUET-244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alex Levenson resolved PARQUET-244.
-----------------------------------
    Resolution: Duplicate

> DeltaByteArrayReader fails with ArrayIndexOutOfBoundsException when moving across pages
> ---------------------------------------------------------------------------------------
>
>                 Key: PARQUET-244
>                 URL: https://issues.apache.org/jira/browse/PARQUET-244
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>    Affects Versions: 1.6.0
>            Reporter: Alosh Bennett
>
> DeltaByteArrayReader.readBytes() fails with ArrayIndexOutOfBoundsException soon after it has processed a new page via initFromPage(). This issue can be reproduced by trying to read a Binary column that is encoded using delta byte array and spans multiple pages.
> This is happening because ColumnReaderImpl.initDataReader() creates a new ValueReader every time a new page is processed (see _this.dataColumn = dataEncoding.getValuesReader(path, VALUES)_). The DeltaByteArrayReader is stateful and needs to remember the _previous_ Binary value that was read across pages. When a new DeltaByteArrayReader is created, this information is lost.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)