You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Gopal Vijayaraghavan (Jira)" <ji...@apache.org> on 2020/02/13 15:58:00 UTC

[jira] [Assigned] (ORC-600) StringDictionaryColumnReader does not update index buffer correctly

     [ https://issues.apache.org/jira/browse/ORC-600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gopal Vijayaraghavan reassigned ORC-600:
----------------------------------------

    Assignee: Norbert Luksa

> StringDictionaryColumnReader does not update index buffer correctly
> -------------------------------------------------------------------
>
>                 Key: ORC-600
>                 URL: https://issues.apache.org/jira/browse/ORC-600
>             Project: ORC
>          Issue Type: Bug
>            Reporter: Norbert Luksa
>            Assignee: Norbert Luksa
>            Priority: Major
>              Labels: c++
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When we are scanning with StringDictionaryColumnReader an encoded batch, ORC crashes if we want to read more elements than the current capacity of the batch. It can occur eg. with string lists where it's possible that the offsets defined in the list overflows in the index buffer of the EncodedStringDictionaryBatch.
> To fix we only have to define a resize method for EncodedStringDictionaryBatch that also resizes the index buffer.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)