You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "James Turton (Jira)" <ji...@apache.org> on 2022/07/19 07:32:00 UTC
[jira] [Commented] (DRILL-7399) Querying parquet file with boolean data type return wrong results
[ https://issues.apache.org/jira/browse/DRILL-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568389#comment-17568389 ]
James Turton commented on DRILL-7399:
-------------------------------------
As a matter of interest, the transcript timings show that the old reader is ~6x faster for this query.
> Querying parquet file with boolean data type return wrong results
> -----------------------------------------------------------------
>
> Key: DRILL-7399
> URL: https://issues.apache.org/jira/browse/DRILL-7399
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Parquet
> Affects Versions: 1.16.0
> Reporter: Fabian Barreiro
> Assignee: James Turton
> Priority: Critical
> Fix For: 1.20.1
>
> Attachments: newrule22_3_1.parquet
>
>
> The following query return a wrong value for the boolean column press_run_1:
> SELECT * FROM dfs.root.`/tmp/newrule22_3_1.parquet` WHERE cycle_id=23435119
> The query return press_run_1 = 'false'
> the parquet file contain pess_run_1 = 'true' value for this record.
> You can find many records with this problem if try different selects.
> ATTACHED: newrule22_3_1.parquet file.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)