You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2020/06/15 17:48:00 UTC

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

     [ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neal Richardson updated ARROW-3764:
-----------------------------------
    Fix Version/s:     (was: 1.0.0)
                   2.0.0

> [C++] Port Python "ParquetDataset" business logic to C++
> --------------------------------------------------------
>
>                 Key: ARROW-3764
>                 URL: https://issues.apache.org/jira/browse/ARROW-3764
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>              Labels: dataset, dataset-parquet-read, parquet
>             Fix For: 2.0.0
>
>
> Along with defining appropriate abstractions for dealing with generic filesystems in C++, we should implement the machinery for reading multiple Parquet files in C++ so that it can reused in GLib, R, and Ruby. Otherwise these languages will have to reimplement things, and this would surely result in inconsistent features, bugs in some implementations but not others



--
This message was sent by Atlassian Jira
(v8.3.4#803005)