You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Jan Soubusta (Jira)" <ji...@apache.org> on 2021/06/23 15:33:00 UTC
[jira] [Resolved] (DRILL-7797) Decimals are wrongly read from
parquet files
[ https://issues.apache.org/jira/browse/DRILL-7797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jan Soubusta resolved DRILL-7797.
---------------------------------
Resolution: Fixed
Don't know, what you did, but it is fixed, I verified it on TPCH data stored in Minio in PARQUET format, decimals are no longer corrupted
> Decimals are wrongly read from parquet files
> --------------------------------------------
>
> Key: DRILL-7797
> URL: https://issues.apache.org/jira/browse/DRILL-7797
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Parquet
> Affects Versions: 1.18.0
> Reporter: Jan Soubusta
> Priority: Major
>
> My setup:
> Docker embedded Drill 1.18 (latest).
> Parquet decimals are wrongly read by Drill, wrong huge / negative values are displayed.
> Example small public file:
> [https://gdc-tiger-test-data-eu-central.s3.eu-central-1.amazonaws.com/other_files/tpch/supplier.parquet/a51ab8fd-v_verticadb_node0001-140644911675136-0.parquet]
> The file was exported from Vertica database using EXPORT TO PARQUET statement.
> My colleague utilizes his parquet reader written in C++ and this is his comment:
> {quote}
> 6th column S_ACCTBAL has type FIXED_LEN_BYTE_ARRAY and convertedType DECIMAL with scale 2 and precision 15.
> I would say, that it is correctly conforming specification.
> {quote}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)