You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Nils Schlicher (Jira)" <ji...@apache.org> on 2020/03/25 20:27:00 UTC

[jira] [Commented] (DRILL-7399) Querying parquet file with boolean data type return wrong results

    [ https://issues.apache.org/jira/browse/DRILL-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17067072#comment-17067072 ] 

Nils Schlicher commented on DRILL-7399:
---------------------------------------

I got the same problem in 1.17.0 with my parquet data

 

The following query returns wrong results. Using the new reader solves the problem, but the new reader is much slower than the default one.


 {{SELECT COUNT( * ) FROM data WHERE measurement_point_valid= false;}}

> Querying parquet file with boolean data type return wrong results
> -----------------------------------------------------------------
>
>                 Key: DRILL-7399
>                 URL: https://issues.apache.org/jira/browse/DRILL-7399
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>    Affects Versions: 1.16.0
>            Reporter: Fabian Barreiro
>            Priority: Critical
>         Attachments: newrule22_3_1.parquet
>
>
> The following query return a wrong value for the boolean column press_run_1:
>  SELECT * FROM dfs.root.`/tmp/newrule22_3_1.parquet` WHERE cycle_id=23435119
> The query return press_run_1 = 'false'
> the parquet file contain pess_run_1 = 'true' value for this record.
> You can find many records with this problem if try different selects.
> ATTACHED:  newrule22_3_1.parquet file.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)