You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@hudi.apache.org by "Vinoth Chandar (Jira)" <ji...@apache.org> on 2019/11/10 22:29:00 UTC

[jira] [Updated] (HUDI-90) Explore ways of indexing record keys in parquet addition to bloom filters

     [ https://issues.apache.org/jira/browse/HUDI-90?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinoth Chandar updated HUDI-90:
-------------------------------
    Summary: Explore ways of indexing record keys in parquet addition to bloom filters  (was: Explore ways of indexing record keys in addition to bloom filters)

> Explore ways of indexing record keys in parquet addition to bloom filters
> -------------------------------------------------------------------------
>
>                 Key: HUDI-90
>                 URL: https://issues.apache.org/jira/browse/HUDI-90
>             Project: Apache Hudi (incubating)
>          Issue Type: New Feature
>          Components: Index, Performance
>            Reporter: Vinoth Chandar
>            Assignee: Vinoth Chandar
>            Priority: Major
>              Labels: realtime-data-lakes
>
> https://issues.apache.org/jira/browse/PARQUET-1201 adds column indexes directly .. avaialable from parquet 1.10
>  
> [~vbalaji] . [~nishith29] thought you might be interested.. This can speed up indexes in case bloom filter/range info produces false positives



--
This message was sent by Atlassian Jira
(v8.3.4#803005)