You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/04/22 17:57:00 UTC

[jira] [Updated] (ARROW-12500) [C++][Dataset] Consolidate similar tests for file formats

     [ https://issues.apache.org/jira/browse/ARROW-12500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated ARROW-12500:
-----------------------------------
    Labels: dataset datasets pull-request-available  (was: dataset datasets)

> [C++][Dataset] Consolidate similar tests for file formats
> ---------------------------------------------------------
>
>                 Key: ARROW-12500
>                 URL: https://issues.apache.org/jira/browse/ARROW-12500
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: David Li
>            Assignee: David Li
>            Priority: Major
>              Labels: dataset, datasets, pull-request-available
>             Fix For: 5.0.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Between CSV/Parquet/IPC we have a number of very similar or in some cases essentially identical tests. As we're doing more refactoring and development it would be nice to consolidate these tests so that we can ensure all formats behave consistently and get the same level of testing. For instance, ARROW-11772 now adds more comprehensive tests for scanning IPC which don't yet apply to Parquet/CSV.
> This sort of consolidation may also be nice to do in Python.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)