You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gabor Szadovszky (Jira)" <ji...@apache.org> on 2021/04/19 07:54:00 UTC

[jira] [Assigned] (PARQUET-1982) Allow random access to row groups in ParquetFileReader

     [ https://issues.apache.org/jira/browse/PARQUET-1982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gabor Szadovszky reassigned PARQUET-1982:
-----------------------------------------

    Assignee: Felix Schmalzel

> Allow random access to row groups in ParquetFileReader
> ------------------------------------------------------
>
>                 Key: PARQUET-1982
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1982
>             Project: Parquet
>          Issue Type: New Feature
>          Components: parquet-mr
>            Reporter: Felix Schmalzel
>            Assignee: Felix Schmalzel
>            Priority: Minor
>              Labels: parquetReader, random-access
>
> The used SeekableInputStream and all other components of the ParquetFileReader already support random access and row groups should be independent of each other.
> This would allow reusing the opened InputStream when you want to go back a row group. It also makes accessing specific row groups a lot easier.
> I've already developed a patch that would enable this functionality. I will link the merge request in the next few days.
> Is there a related ticket that i have overlooked?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)