You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Xinli Shang (Jira)" <ji...@apache.org> on 2020/04/09 15:58:00 UTC

[jira] [Commented] (PARQUET-1836) why the last chunk might be larger than descriptor.size?

    [ https://issues.apache.org/jira/browse/PARQUET-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17079512#comment-17079512 ] 

Xinli Shang commented on PARQUET-1836:
--------------------------------------

I just searched the change history and see the fix comes from this change https://github.com/apache/parquet-mr/commit/b297c73c1082728ad9626d17ce0f7abe6abaa36b which is in parquet-1.8.0rc1. That means the files written by version 1.7 and below could be considered as 'old files'. 





> why the last chunk might be larger than descriptor.size?
> --------------------------------------------------------
>
>                 Key: PARQUET-1836
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1836
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>            Reporter: Zhenglin luo
>            Priority: Major
>
> i don't know why the last chunk might be larger than descriptor.size.
> I saw the annotation saying "It is for reading old files".So there is no problem with the new file,isn't there?
> by the way ,How to distinguish old and new files.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)