You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2018/09/21 15:33:00 UTC

[jira] [Created] (PARQUET-1422) [C++] Use Arrow IO interfaces natively rather than current parquet:: wrappers

Wes McKinney created PARQUET-1422:
-------------------------------------

             Summary: [C++] Use Arrow IO interfaces natively rather than current parquet:: wrappers
                 Key: PARQUET-1422
                 URL: https://issues.apache.org/jira/browse/PARQUET-1422
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-cpp
            Reporter: Wes McKinney
             Fix For: cpp-1.6.0


We are beginning to do some work on asynchronous IO in Arrow and it would be great to be able to leverage this in the Parquet core internals. 

I am proposing to remove the Parquet-specific virtual file interfaces in

https://github.com/apache/arrow/blob/master/cpp/src/parquet/util/memory.h#L221

and instead rely directly on the Arrow ones in arrow::io. In addition to reducing the amount of code we have to maintain, we will also be able to improve performance of Parquet by utilizing common utilities for managing asynchronous / background IO

cc [~mdeepak] [~xhochy]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)