You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Felix Schmalzel (Jira)" <ji...@apache.org> on 2021/02/16 15:16:00 UTC

[jira] [Created] (PARQUET-1982) Allow random access to row groups in ParquetFileReader

Felix Schmalzel created PARQUET-1982:
----------------------------------------

             Summary: Allow random access to row groups in ParquetFileReader
                 Key: PARQUET-1982
                 URL: https://issues.apache.org/jira/browse/PARQUET-1982
             Project: Parquet
          Issue Type: New Feature
          Components: parquet-mr
            Reporter: Felix Schmalzel


The used SeekableInputStream and all other components of the ParquetFileReader already support random access and row groups should be independent of each other.

This would allow reusing the opened InputStream when you want to go back a row group. It also makes accessing specific row groups a lot easier.

I've already developed a patch that would enable this functionality. I will link the merge request in the next few days.

Is there a related ticket that i have overlooked?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)