You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Ryan Skraba (Jira)" <ji...@apache.org> on 2019/08/29 13:47:00 UTC

[jira] [Created] (PARQUET-1644) Benchmark module needs some attention.

Ryan Skraba created PARQUET-1644:
------------------------------------

             Summary: Benchmark module needs some attention.
                 Key: PARQUET-1644
                 URL: https://issues.apache.org/jira/browse/PARQUET-1644
             Project: Parquet
          Issue Type: Bug
          Components: parquet-mr
            Reporter: Ryan Skraba


Strictly following the instructions on the [parquet-benchmarks|https://github.com/apache/parquet-mr/tree/fcc5d1a5a669570de3daeafd3f3b7788aa618536/parquet-benchmarks] module doesn't give meaningful results.

It appears some new benchmarks enter into conflict with the globs specified for others, not all benchmarks are run, and some iterations of write benchmarks aren't evalulated due to unexpected "file already exists ..." fail-fast returns in the data generator.

This should be cleaned up to encourage using and implementing benchmarks on Parquet code.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)