You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gabor Szadovszky (Jira)" <ji...@apache.org> on 2019/09/24 10:27:00 UTC

[jira] [Updated] (PARQUET-1644) Clean up some benchmark code and docs.

     [ https://issues.apache.org/jira/browse/PARQUET-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gabor Szadovszky updated PARQUET-1644:
--------------------------------------
    Summary: Clean up some benchmark code and docs.  (was: Benchmark module needs some attention.)

> Clean up some benchmark code and docs.
> --------------------------------------
>
>                 Key: PARQUET-1644
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1644
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>            Reporter: Ryan Skraba
>            Assignee: Ryan Skraba
>            Priority: Minor
>              Labels: pull-request-available
>
> Strictly following the instructions on the [parquet-benchmarks|https://github.com/apache/parquet-mr/tree/fcc5d1a5a669570de3daeafd3f3b7788aa618536/parquet-benchmarks] module doesn't give meaningful results.
> It appears some new benchmarks enter into conflict with the globs specified for others, not all benchmarks are run, and some iterations of write benchmarks aren't evalulated due to unexpected "file already exists ..." fail-fast returns in the data generator.
> This should be cleaned up to encourage using and implementing benchmarks on Parquet code.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)