You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Norbert Luksa (Jira)" <ji...@apache.org> on 2019/09/12 14:36:00 UTC

[jira] [Work started] (IMPALA-8498) Write column index for floating types when NaN is not present

     [ https://issues.apache.org/jira/browse/IMPALA-8498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on IMPALA-8498 started by Norbert Luksa.
---------------------------------------------
> Write column index for floating types when NaN is not present
> -------------------------------------------------------------
>
>                 Key: IMPALA-8498
>                 URL: https://issues.apache.org/jira/browse/IMPALA-8498
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Zoltán Borók-Nagy
>            Assignee: Norbert Luksa
>            Priority: Major
>              Labels: ramp-up
>
> IMPALA-7304 disabled column index writing for floating point columns until PARQUET-1222 is resolved.
> PARQUET-1222 is responsible for defining a total order for floating values, but the problematic values are only the NaNs. Therefore we can write the column index if NaNs are not present in the data. Parquet-MR also does this, following the principles in [https://github.com/apache/parquet-format/blob/75eb7a7b84e6e62bfb09668b6d8d40b12597456e/src/main/thrift/parquet.thrift#L827-L834] 
> Impala should follow this behavior, and also when storing zeroes, it should store -0.0 as minimum and +0.0 as maximum.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org