You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Felix Schmalzel (Jira)" <ji...@apache.org> on 2021/02/16 15:16:00 UTC
[jira] [Created] (PARQUET-1982) Allow random access to row groups
in ParquetFileReader
Felix Schmalzel created PARQUET-1982:
----------------------------------------
Summary: Allow random access to row groups in ParquetFileReader
Key: PARQUET-1982
URL: https://issues.apache.org/jira/browse/PARQUET-1982
Project: Parquet
Issue Type: New Feature
Components: parquet-mr
Reporter: Felix Schmalzel
The used SeekableInputStream and all other components of the ParquetFileReader already support random access and row groups should be independent of each other.
This would allow reusing the opened InputStream when you want to go back a row group. It also makes accessing specific row groups a lot easier.
I've already developed a patch that would enable this functionality. I will link the merge request in the next few days.
Is there a related ticket that i have overlooked?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)