You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Leping Huang (Jira)" <ji...@apache.org> on 2020/08/20 19:57:00 UTC

[jira] [Assigned] (HUDI-113) Get rid of using "#" as the separator in BloomIndex shuffling

     [ https://issues.apache.org/jira/browse/HUDI-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Leping Huang reassigned HUDI-113:
---------------------------------

    Assignee: Leping Huang

> Get rid of using "#" as the separator in BloomIndex shuffling
> -------------------------------------------------------------
>
>                 Key: HUDI-113
>                 URL: https://issues.apache.org/jira/browse/HUDI-113
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: Index, newbie
>            Reporter: Vinoth Chandar
>            Assignee: Leping Huang
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> In explodeRecordRDDWithFileComparisons, we should use a Tuple2 instead of "#" 
> {code:java}
> List<Tuple2<String, Tuple2<String, HoodieKey>>> recordComparisons = new ArrayList<>();
> indexFileFilter.getMatchingFiles(partitionPath, recordKey).forEach(matchingFile -> {
>   recordComparisons.add(
>       new Tuple2<>(String.format("%s#%s", matchingFile, recordKey),
>           new Tuple2<>(matchingFile,
>               new HoodieKey(recordKey, partitionPath))));{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)