You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Ryan Blue (JIRA)" <ji...@apache.org> on 2018/09/27 22:33:00 UTC

[jira] [Commented] (PARQUET-1201) Column indexes

    [ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16631130#comment-16631130 ] 

Ryan Blue commented on PARQUET-1201:
------------------------------------

[~gszadovszky], where is the branch for page skipping? Is it this one? https://github.com/apache/parquet-mr/tree/column-indexes

I just went to review it, but I don't see a PR. Could you open one against master?

> Column indexes
> --------------
>
>                 Key: PARQUET-1201
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1201
>             Project: Parquet
>          Issue Type: New Feature
>    Affects Versions: 1.10.0
>            Reporter: Gabor Szadovszky
>            Assignee: Gabor Szadovszky
>            Priority: Major
>             Fix For: format-2.5.0
>
>
> Write the column indexes described in PARQUET-922.
>  This is the first phase of implementing the whole feature. The implementation is done in the following steps:
>  * Utility to read/write indexes in parquet-format
>  * Writing indexes in the parquet file
>  * Extend parquet-tools and parquet-cli to show the indexes
>  * Limit index size based on parquet properties
>  * Trim min/max values where possible based on parquet properties
>  * Filtering based on column indexes
> The work is done on the feature branch {{column-indexes}}. This JIRA will be resolved after the branch has been merged to {{master}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)