You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neville Dipale (Jira)" <ji...@apache.org> on 2021/03/30 16:35:00 UTC

[jira] [Created] (ARROW-12153) [Rust] [Parquet] Return file metadata after writing Parquet file

Neville Dipale created ARROW-12153:
--------------------------------------

             Summary: [Rust] [Parquet] Return file metadata after writing Parquet file
                 Key: ARROW-12153
                 URL: https://issues.apache.org/jira/browse/ARROW-12153
             Project: Apache Arrow
          Issue Type: New Feature
            Reporter: Neville Dipale
            Assignee: Neville Dipale


Parquet writers like delta-rs rely on the Parquet metadata to write file-level statistics for file pruning purposes.

We currently do not expose these stats, requiring the writer to read the file that has just been written, to get the stats. This is more problematic for in-memory sinks, as there is currently no way of getting the metadata from the sink before it's persisted.

Explore if we can expose these stats to the writer, to make the above easier.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)