You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Francois Saint-Jacques (Jira)" <ji...@apache.org> on 2019/08/21 16:33:01 UTC

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

     [ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Francois Saint-Jacques updated ARROW-3764:
------------------------------------------
    Labels: dataset datasets parquet  (was: datasets parquet)

> [C++] Port Python "ParquetDataset" business logic to C++
> --------------------------------------------------------
>
>                 Key: ARROW-3764
>                 URL: https://issues.apache.org/jira/browse/ARROW-3764
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>              Labels: dataset, datasets, parquet
>             Fix For: 1.0.0
>
>
> Along with defining appropriate abstractions for dealing with generic filesystems in C++, we should implement the machinery for reading multiple Parquet files in C++ so that it can reused in GLib, R, and Ruby. Otherwise these languages will have to reimplement things, and this would surely result in inconsistent features, bugs in some implementations but not others



--
This message was sent by Atlassian Jira
(v8.3.2#803003)