You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Alexey Kudinkin (Jira)" <ji...@apache.org> on 2022/09/16 20:11:00 UTC

[jira] [Created] (HUDI-4862) Add an option leverage RDDCustomColumnsSortPartitioner in bulk-insert

Alexey Kudinkin created HUDI-4862:
-------------------------------------

             Summary: Add an option leverage RDDCustomColumnsSortPartitioner in bulk-insert
                 Key: HUDI-4862
                 URL: https://issues.apache.org/jira/browse/HUDI-4862
             Project: Apache Hudi
          Issue Type: Improvement
            Reporter: Alexey Kudinkin
            Assignee: Alexey Kudinkin


Currently, to sort incoming batch using DeltaStreamer you'll have to resort to using SqlQueryBasedTransformer. 

This however should not be necessary provided that RDDCustomColumnsSortPartitioner does already exist and could be leveraged exactly for that.

This come up during discussions w/ [~abmodi]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)