You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gabor Szadovszky (JIRA)" <ji...@apache.org> on 2018/12/13 16:02:00 UTC

[jira] [Resolved] (PARQUET-1472) Dictionary filter fails on FIXED_LEN_BYTE_ARRAY

     [ https://issues.apache.org/jira/browse/PARQUET-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gabor Szadovszky resolved PARQUET-1472.
---------------------------------------
       Resolution: Fixed
    Fix Version/s: 1.11.0

> Dictionary filter fails on FIXED_LEN_BYTE_ARRAY
> -----------------------------------------------
>
>                 Key: PARQUET-1472
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1472
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>            Reporter: Gabor Szadovszky
>            Assignee: Gabor Szadovszky
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.11.0
>
>
> DictonaryFilter does not handle FIXED_LEN_BYTE_ARRAY. Moreover, [DictionaryFilter.expandFilter(ColumnChunkMetaData)|https://github.com/apache/parquet-mr/blob/dc61e510126aaa1a95a46fe39bf1529f394147e9/parquet-hadoop/src/main/java/org/apache/parquet/filter2/dictionarylevel/DictionaryFilter.java#L78] returns an empty map instead of null therefore the row-group might be dropped as the value seems to not being in the dictionary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)