You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Pradeep Kamath (JIRA)" <ji...@apache.org> on 2009/10/09 03:44:31 UTC

[jira] Commented: (PIG-894) order-by fails when input is empty

    [ https://issues.apache.org/jira/browse/PIG-894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12763776#action_12763776 ] 

Pradeep Kamath commented on PIG-894:
------------------------------------

The patch uses pig.inputs property from jobconf which does not directly have the input file name - it actually has a serialized arrayList<Pair<FileSpec, Boolean>> in string form containing the filespec and the issplittable flag for each input for the job - this serialized string will need to be deserialized using ObjectSerializer.deserialize and then from the filespec, the filename will need to be retrieved.

> order-by fails when input is empty
> ----------------------------------
>
>                 Key: PIG-894
>                 URL: https://issues.apache.org/jira/browse/PIG-894
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Thejas M Nair
>            Assignee: Daniel Dai
>         Attachments: PIG-894-1.patch
>
>
> grunt> l = load 'students.txt' ;
> grunt> f = filter l by 1 == 2;
> grunt> o = order f by $0 ;
> grunt> dump o;
> This results in 3 MR jobs . The 2nd (sampling) MR creates empty sample file, and 3rd MR (order-by) fails with following error in Map job -
> java.lang.RuntimeException: java.lang.RuntimeException: Empty samples file
> 	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.configure(WeightedRangePartitioner.java:104)
> 	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:58)
> 	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:82)
> 	at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.(MapTask.java:348)
> 	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:193)
> 	at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2207)
> Caused by: java.lang.RuntimeException: Empty samples file
> 	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.configure(WeightedRangePartitioner.java:89)
> 	... 5 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.