You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Natang (Jira)" <ji...@apache.org> on 2020/07/10 17:07:00 UTC
[jira] [Commented] (PARQUET-1580) Page-level CRC checksum
verification for DataPageV1
[ https://issues.apache.org/jira/browse/PARQUET-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17155616#comment-17155616 ]
Natang commented on PARQUET-1580:
---------------------------------
Can this be backported to 1.10.1?
> Page-level CRC checksum verification for DataPageV1
> ---------------------------------------------------
>
> Key: PARQUET-1580
> URL: https://issues.apache.org/jira/browse/PARQUET-1580
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Reporter: Boudewijn Braams
> Assignee: Boudewijn Braams
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.11.0
>
>
> Now that the specification with respect to page level checksums has been clarified (see https://jira.apache.org/jira/browse/PARQUET-1539), we can actually implement it. What needs to be done:
> * Implement calculating and writing checksums on write path
> * Implement checksum validation on read path
> * Make writing out checksums and validation on reads optional (default off)
> * Implement tests
--
This message was sent by Atlassian Jira
(v8.3.4#803005)