You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Boudewijn Braams (JIRA)" <ji...@apache.org> on 2019/07/19 16:45:00 UTC

[jira] [Created] (PARQUET-1629) Page-level CRC checksum verification for DataPageV1

Boudewijn Braams created PARQUET-1629:
-----------------------------------------

             Summary: Page-level CRC checksum verification for DataPageV1
                 Key: PARQUET-1629
                 URL: https://issues.apache.org/jira/browse/PARQUET-1629
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-mr
            Reporter: Boudewijn Braams


In https://jira.apache.org/jira/browse/PARQUET-1580 (Github PR: https://github.com/apache/parquet-mr/pull/647) we implemented page level CRC checksum verification for DataPageV1. As a follow up, we should add support for DataPageV2 that follows the spec (see see https://jira.apache.org/jira/browse/PARQUET-1539).

What needs to be done:
* Add writing out checksums for DataPageV2
* Add checksum verification for DataPageV2
* Create new test suite
* Create new benchmarks



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)