You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Leping Huang (Jira)" <ji...@apache.org> on 2020/08/20 19:57:00 UTC
[jira] [Assigned] (HUDI-113) Get rid of using "#" as the separator
in BloomIndex shuffling
[ https://issues.apache.org/jira/browse/HUDI-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Leping Huang reassigned HUDI-113:
---------------------------------
Assignee: Leping Huang
> Get rid of using "#" as the separator in BloomIndex shuffling
> -------------------------------------------------------------
>
> Key: HUDI-113
> URL: https://issues.apache.org/jira/browse/HUDI-113
> Project: Apache Hudi
> Issue Type: Bug
> Components: Index, newbie
> Reporter: Vinoth Chandar
> Assignee: Leping Huang
> Priority: Major
> Labels: pull-request-available
> Time Spent: 20m
> Remaining Estimate: 0h
>
> In explodeRecordRDDWithFileComparisons, we should use a Tuple2 instead of "#"
> {code:java}
> List<Tuple2<String, Tuple2<String, HoodieKey>>> recordComparisons = new ArrayList<>();
> indexFileFilter.getMatchingFiles(partitionPath, recordKey).forEach(matchingFile -> {
> recordComparisons.add(
> new Tuple2<>(String.format("%s#%s", matchingFile, recordKey),
> new Tuple2<>(matchingFile,
> new HoodieKey(recordKey, partitionPath))));{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)