You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Rotem Levi (Jira)" <ji...@apache.org> on 2023/06/04 15:46:00 UTC

[jira] [Comment Edited] (PARQUET-2306) Parquet statistics stats_null_count is wrong for large json string colums

    [ https://issues.apache.org/jira/browse/PARQUET-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17729074#comment-17729074 ] 

Rotem Levi edited comment on PARQUET-2306 at 6/4/23 3:45 PM:
-------------------------------------------------------------

sure, i have added a parquet file to reproduce the issue.

[~wgtmac] 


was (Author: JIRAUSER300578):
sure, i have added a parquet file to reproduce the issue.

> Parquet statistics stats_null_count is wrong for large json string colums
> -------------------------------------------------------------------------
>
>                 Key: PARQUET-2306
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2306
>             Project: Parquet
>          Issue Type: Bug
>            Reporter: Rotem Levi
>            Priority: Major
>         Attachments: part-00000-7e5dcc44-d176-42ab-98b9-664cace5d433-c000.snappy.parquet
>
>
> stats_null_count always None when one of the columns values is about 46977 characters length   



--
This message was sent by Atlassian Jira
(v8.20.10#820010)