You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "James Turton (Jira)" <ji...@apache.org> on 2022/07/19 07:32:00 UTC

[jira] [Commented] (DRILL-7399) Querying parquet file with boolean data type return wrong results

    [ https://issues.apache.org/jira/browse/DRILL-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568389#comment-17568389 ] 

James Turton commented on DRILL-7399:
-------------------------------------

As a matter of interest, the transcript timings show that the old reader is ~6x faster for this query.

> Querying parquet file with boolean data type return wrong results
> -----------------------------------------------------------------
>
>                 Key: DRILL-7399
>                 URL: https://issues.apache.org/jira/browse/DRILL-7399
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>    Affects Versions: 1.16.0
>            Reporter: Fabian Barreiro
>            Assignee: James Turton
>            Priority: Critical
>             Fix For: 1.20.1
>
>         Attachments: newrule22_3_1.parquet
>
>
> The following query return a wrong value for the boolean column press_run_1:
>  SELECT * FROM dfs.root.`/tmp/newrule22_3_1.parquet` WHERE cycle_id=23435119
> The query return press_run_1 = 'false'
> the parquet file contain pess_run_1 = 'true' value for this record.
> You can find many records with this problem if try different selects.
> ATTACHED:  newrule22_3_1.parquet file.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)