You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by GitBox <gi...@apache.org> on 2020/06/28 10:35:18 UTC
[GitHub] [parquet-mr] samarthjain commented on pull request #671: PARQUET-1643 Use airlift codecs for LZ4, LZ0, GZIP
samarthjain commented on pull request #671:
URL: https://github.com/apache/parquet-mr/pull/671#issuecomment-650731696
Force pushed a new commit that makes it configurable whether to use Airlift based compressors or not. Also added tests and GZIP benchmarks for Airlift compressors. Benchmark results reveal that there are no performance improvements or regressions when using Airlift GZIP vs plain GZIP.
```
PageChecksumReadBenchmarks.read10MRowsAirliftGzipWithVerification 3 6.431 ± 0.741
PageChecksumReadBenchmarks.read10MRowsAirliftGzipWithoutVerification 3 6.605 ± 0.709
PageChecksumReadBenchmarks.read10MRowsGzipWithVerification 3 6.468 ± 0.700
PageChecksumReadBenchmarks.read10MRowsGzipWithoutVerification 3 6.583 ± 1.538
PageChecksumWriteBenchmarks.write10MRowsAirliftGzipWithChecksums 3 36.333 ± 0.510
PageChecksumWriteBenchmarks.write10MRowsAirliftGzipWithoutChecksums 3 36.069 ± 1.096
PageChecksumWriteBenchmarks.write10MRowsGzipWithChecksums 3 36.141 ± 1.095
PageChecksumWriteBenchmarks.write10MRowsGzipWithoutChecksums 3 36.174 ± 5.125
ReadBenchmarks.read1MRowsDefaultBlockAndPageSizeAirliftGZIP 3 0.898 ± 1.254
ReadBenchmarks.read1MRowsDefaultBlockAndPageSizeGZIP 3 0.891 ± 1.201
```
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org