You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Natang (Jira)" <ji...@apache.org> on 2020/07/10 17:07:00 UTC

[jira] [Commented] (PARQUET-1580) Page-level CRC checksum verification for DataPageV1

    [ https://issues.apache.org/jira/browse/PARQUET-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17155616#comment-17155616 ] 

Natang commented on PARQUET-1580:
---------------------------------

Can this be backported to 1.10.1?

> Page-level CRC checksum verification for DataPageV1
> ---------------------------------------------------
>
>                 Key: PARQUET-1580
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1580
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>            Reporter: Boudewijn Braams
>            Assignee: Boudewijn Braams
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.11.0
>
>
> Now that the specification with respect to page level checksums has been clarified (see https://jira.apache.org/jira/browse/PARQUET-1539), we can actually implement it. What needs to be done:
> * Implement calculating and writing checksums on write path
> * Implement checksum validation on read path
> * Make writing out checksums and validation on reads optional (default off)
> * Implement tests



--
This message was sent by Atlassian Jira
(v8.3.4#803005)