You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Brad Tofel (JIRA)" <ji...@apache.org> on 2011/07/13 10:10:00 UTC

[jira] [Commented] (PIG-282) Custom Partitioner

    [ https://issues.apache.org/jira/browse/PIG-282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13064418#comment-13064418 ] 

Brad Tofel commented on PIG-282:
--------------------------------

Do I read this right - there is no way to specify a custom partitioner for use with "ORDER BY"?

If so, is there any other way to perform a total ordering within Pig?

I will be doing a STORE immediately after the ORDER - the relation will not be used again. Is there some other work around to achieve this?

I would love to replace my current Hadoop Java code with Pig, but total ordering is a requirement.

> Custom Partitioner
> ------------------
>
>                 Key: PIG-282
>                 URL: https://issues.apache.org/jira/browse/PIG-282
>             Project: Pig
>          Issue Type: New Feature
>    Affects Versions: 0.7.0
>            Reporter: Amir Youssefi
>            Assignee: Aniket Mokashi
>            Priority: Minor
>             Fix For: 0.8.0
>
>         Attachments: CustomPartitioner.patch, CustomPartitionerFinale.patch, CustomPartitionerTest.patch
>
>
> By adding custom partitioner we can give control over which output partition a key (/value) goes to. We can add keywords to language e.g. 
> PARTITION BY UDF(...)
> or a similar syntax. UDF returns a number between 0 and n-1 where n is number of output partitions.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira