You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/08/19 15:46:00 UTC

[jira] [Updated] (HUDI-2322) Only include meta fields to reorder while preparing dataset for bulk insert

     [ https://issues.apache.org/jira/browse/HUDI-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated HUDI-2322:
---------------------------------
    Labels: pull-request-available  (was: )

> Only include meta fields to reorder while preparing dataset for bulk insert
> ---------------------------------------------------------------------------
>
>                 Key: HUDI-2322
>                 URL: https://issues.apache.org/jira/browse/HUDI-2322
>             Project: Apache Hudi
>          Issue Type: Bug
>            Reporter: Sagar Sumit
>            Assignee: Sagar Sumit
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.9.0
>
>
> Below filter in `HoodieDatasetBulkInsertHelper` will result in `_hoodie_is_deleted` to be reordered as well even though it is not part of meta columns. 
> {code:java}
> List<Column> originalFields =
>         Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field -> !field.name().contains("_hoodie_")).map(f -> new Column(f.name())).collect(Collectors.toList());
>     List<Column> metaFields =
>         Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field -> field.name().contains("_hoodie_")).map(f -> new Column(f.name())).collect(Collectors.toList());
> {code}
> The fix is to check only for `HoodieRecord.HOODIE_META_COLUMNS_WITH_OPERATION`.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)