You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Remi Dettai (Jira)" <ji...@apache.org> on 2020/09/29 13:52:00 UTC

[jira] [Created] (ARROW-10135) [Rust] [Parquet] Refactor file module to help adding sources

Remi Dettai created ARROW-10135:
-----------------------------------

             Summary: [Rust] [Parquet] Refactor file module to help adding sources
                 Key: ARROW-10135
                 URL: https://issues.apache.org/jira/browse/ARROW-10135
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Rust
    Affects Versions: 1.0.1
            Reporter: Remi Dettai


Currently, the Parquet reader is very strongly tied to file system reads. This makes it hard to add other sources, such as S3...

To improve modularity, we could try to move as much logic as possible to the generic traits (FileReader, RowGroupReader...) and reduce the content of the implementing structs (SerializedFileReader, SerializedRowGroupReader...) to the part that is specific to file/buffered reads.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)