You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Fabian Hueske (JIRA)" <ji...@apache.org> on 2016/03/24 12:43:25 UTC

[jira] [Created] (FLINK-3665) Range partitioning lacks support to define sort orders

Fabian Hueske created FLINK-3665:
------------------------------------

             Summary: Range partitioning lacks support to define sort orders
                 Key: FLINK-3665
                 URL: https://issues.apache.org/jira/browse/FLINK-3665
             Project: Flink
          Issue Type: Improvement
          Components: DataSet API
    Affects Versions: 1.0.0
            Reporter: Fabian Hueske
             Fix For: 1.1.0


{{DataSet.partitionByRange()}} does not allow to specify the sort order of fields. This is fine if range partitioning is used to reduce skewed partitioning. 
However, it is not sufficient if range partitioning is used to sort a data set in parallel. 

Since {{DataSet.partitionByRange()}} is {{@Public}} API and cannot be easily changed, I propose to add a method {{withOrders(Order... orders)}} to {{PartitionOperator}}. The method should throw an exception if the partitioning method of {{PartitionOperator}} is not range partitioning.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)