You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Vinoth Chandar (Jira)" <ji...@apache.org> on 2021/01/21 05:41:00 UTC

[jira] [Updated] (HUDI-1303) Some improvements for the HUDI Test Suite

     [ https://issues.apache.org/jira/browse/HUDI-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinoth Chandar updated HUDI-1303:
---------------------------------
    Fix Version/s: 0.7.0

> Some improvements for the HUDI Test Suite
> -----------------------------------------
>
>                 Key: HUDI-1303
>                 URL: https://issues.apache.org/jira/browse/HUDI-1303
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Prashant Wason
>            Assignee: Prashant Wason
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 0.7.0
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> # Use the DAG Node's label from the yaml as its name instead of UUID names which are not descriptive when debugging issues from logs.
>  # Fix CleanNode constructor which is not correctly implemented
>  # When generating upsets, allows more granualar control over the number of inserts and upserts - zero or more inserts and upserts can be specified instead of always requiring both inserts and upserts.
>  # Fixed generation of records of specific size
>  ** The current code was using a class variable "shouldAddMore" which was reset to false after the first record generation causing subsequent records to be of minimum size.
>  ** In this change, we pre-calculate the extra size of the complex fields. When generating records, for complex fields we read the field size from this map.
>  # Refresh the timeline of the DeltaSync service before calling readFromSource. This ensures that only the newest generated data is read and data generated in the older Dag Nodes is ignored (as their AVRO files will have an older timestamp).
>  # Making --workload-generator-classname an optional parameter as most probably the default will be used



--
This message was sent by Atlassian Jira
(v8.3.4#803005)