You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Sagar Sumit (Jira)" <ji...@apache.org> on 2021/08/19 15:44:00 UTC

[jira] [Created] (HUDI-2322) Only include meta fields to reorder while preparing dataset for bulk insert

Sagar Sumit created HUDI-2322:
---------------------------------

             Summary: Only include meta fields to reorder while preparing dataset for bulk insert
                 Key: HUDI-2322
                 URL: https://issues.apache.org/jira/browse/HUDI-2322
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Sagar Sumit
            Assignee: Sagar Sumit
             Fix For: 0.9.0


Below filter in `HoodieDatasetBulkInsertHelper` will result in `_hoodie_is_deleted` to be reordered as well even though it is not part of meta columns. 

{code:java}
List<Column> originalFields =
        Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field -> !field.name().contains("_hoodie_")).map(f -> new Column(f.name())).collect(Collectors.toList());

    List<Column> metaFields =
        Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field -> field.name().contains("_hoodie_")).map(f -> new Column(f.name())).collect(Collectors.toList());
{code}

The fix is to check only for `HoodieRecord.HOODIE_META_COLUMNS_WITH_OPERATION`.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)