You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Martin Loncaric (Jira)" <ji...@apache.org> on 2022/05/26 17:06:00 UTC

[jira] [Updated] (ORC-1189) Benchmark Documentation Issues

     [ https://issues.apache.org/jira/browse/ORC-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martin Loncaric updated ORC-1189:
---------------------------------
    Summary: Benchmark Documentation Issues  (was: Benchmark Taxi Dataset, Stability, Documentation Issues)

> Benchmark Documentation Issues
> ------------------------------
>
>                 Key: ORC-1189
>                 URL: https://issues.apache.org/jira/browse/ORC-1189
>             Project: ORC
>          Issue Type: Bug
>            Reporter: Martin Loncaric
>            Assignee: Martin Loncaric
>            Priority: Minor
>             Fix For: 1.8.0, 1.7.5
>
>
> * Since 5/12, NYC Taxi dataset used in benchmarks no longer exists as CSV's; has been replaced with Parquet
> https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
> bq. On 05/13/2022, we are making the following changes to trip record files: All files will be stored in the Parquet format. Please see the ‘Working With Parquet Format’ under the Data Dictionaries and MetaData section.
> * Running any benchmark fails with "java.util.ServiceConfigurationError" because one benchmark cannot be instantiated
> * Some documentation could be more helpful, e.g. generate command calling itself "convert" in help page



--
This message was sent by Atlassian Jira
(v8.20.7#820007)