You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Remi Dettai (Jira)" <ji...@apache.org> on 2020/10/26 08:06:00 UTC
[jira] [Created] (ARROW-10387) [Rust] Avoid call for file size
metadata when reading parquet footer
Remi Dettai created ARROW-10387:
-----------------------------------
Summary: [Rust] Avoid call for file size metadata when reading parquet footer
Key: ARROW-10387
URL: https://issues.apache.org/jira/browse/ARROW-10387
Project: Apache Arrow
Issue Type: Improvement
Reporter: Remi Dettai
This is an extension to [ARROW-10368|https://issues.apache.org/jira/browse/ARROW-10368].
When reading a Parquet file, you first need to read its footer and metadata. If you only have "read from start" capability, this means you need the size of the file to read relatively to end. On some storages, getting the size metadata can be expensive (e.g extra http call for blob storage).
The proposition is to add the capability to "read from end" to the ChunkReader trait as most file storages will have this feature (file storage as well as blob storages).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)