You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Xinli Shang (Jira)" <ji...@apache.org> on 2020/04/09 15:58:00 UTC
[jira] [Commented] (PARQUET-1836) why the last chunk might be
larger than descriptor.size?
[ https://issues.apache.org/jira/browse/PARQUET-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17079512#comment-17079512 ]
Xinli Shang commented on PARQUET-1836:
--------------------------------------
I just searched the change history and see the fix comes from this change https://github.com/apache/parquet-mr/commit/b297c73c1082728ad9626d17ce0f7abe6abaa36b which is in parquet-1.8.0rc1. That means the files written by version 1.7 and below could be considered as 'old files'.
> why the last chunk might be larger than descriptor.size?
> --------------------------------------------------------
>
> Key: PARQUET-1836
> URL: https://issues.apache.org/jira/browse/PARQUET-1836
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Reporter: Zhenglin luo
> Priority: Major
>
> i don't know why the last chunk might be larger than descriptor.size.
> I saw the annotation saying "It is for reading old files".So there is no problem with the new file,isn't there?
> by the way ,How to distinguish old and new files.
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)