You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2017/01/23 03:22:26 UTC

[jira] [Updated] (PARQUET-836) [C++] Add column selection to parquet::arrow::FileReader

     [ https://issues.apache.org/jira/browse/PARQUET-836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney updated PARQUET-836:
---------------------------------
    Assignee: Wes McKinney

Included in patch for PARQUET-835: https://github.com/apache/parquet-cpp/pull/222

> [C++] Add column selection to parquet::arrow::FileReader
> --------------------------------------------------------
>
>                 Key: PARQUET-836
>                 URL: https://issues.apache.org/jira/browse/PARQUET-836
>             Project: Parquet
>          Issue Type: New Feature
>          Components: parquet-cpp
>            Reporter: Wes McKinney
>            Assignee: Wes McKinney
>
> With multithreaded reads coming in PARQUET-835, it would be better to push down the subsetting into the FileReader rather than leaving this work at the application level. One artifact of this is the Python interface in Arrow (which handles subsetting in Python/Cython)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)