You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Dmitriy V. Ryaboy (Commented) (JIRA)" <ji...@apache.org> on 2011/11/16 06:20:51 UTC

[jira] [Commented] (PIG-2376) Local mode for GroupBy followed by aggrete function results in ERROR: 2118

    [ https://issues.apache.org/jira/browse/PIG-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13151028#comment-13151028 ] 

Dmitriy V. Ryaboy commented on PIG-2376:
----------------------------------------

The error here says that the input file does not exist. Can you verify that the path /user/user1/pig/tests/data/singlefile/studenttab10k exists on your test machine? I just ran the same script on the tip of branch 0.9, and it works fine.
                
> Local mode for GroupBy followed by aggrete function results in ERROR: 2118
> --------------------------------------------------------------------------
>
>                 Key: PIG-2376
>                 URL: https://issues.apache.org/jira/browse/PIG-2376
>             Project: Pig
>          Issue Type: Bug
>         Environment: Apache Pig version 0.9.2.1111101150 (r1200499)
> compiled Nov 10 2011, 19:50:15
> -bash-3.1$ hadoop version
> Hadoop 0.23.0.1111080202
> Subversion http://svn.apache.org/repos/asf/hadoop/common/branches/branch-0.23.0/hadoop-common-project/hadoop-common -r 1196973
> Compiled by hadoopqa on Tue Nov  8 02:12:04 PST 2011
> From source with checksum 4e42b2d96c899a98a8ab8c7cc23f27ae
>            Reporter: Araceli Henley
>
> Ran in LocalMode
> The following test 
> a = load '/user/user1/pig/tests/data/singlefile/studenttab10k' as (name, age, gpa);
> b = group a by (name, age);
> c = foreach b generate flatten(group), SUM(a.gpa);
> store c into '/user/user1/pig/out/user1.1321317375/GroupAggFunc_12.out';
> :
> ==============
> Failed Jobs:
> :==============
> JobId   Alias   Feature Message Outputs
> job_local_0001  a,b,c   GROUP_BY,COMBINER       Message: org.apache.pig.backend.executionengine.ExecException: ERROR 2118: Input path does not exist: file:/user/user1/pig/tests/data/singlefile/studenttab10k
>         at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:282)
>         at org.apache.hadoop.mapreduce.JobSubmitter.writeNewSplits(JobSubmitter.java:445)
>         at org.apache.hadoop.mapreduce.JobSubmitter.writeSplits(JobSubmitter.java:462)
>         at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:360)
>         at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1159)
>         at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1156)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:396)
>         at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1152)
>         at org.apache.hadoop.mapreduce.Job.submit(Job.java:1156)
>         at org.apache.hadoop.mapreduce.lib.jobcontrol.ControlledJob.submit(ControlledJob.java:336)
>         at org.apache.hadoop.mapreduce.lib.jobcontrol.JobControl.run(JobControl.java:233)
>         at java.lang.Thread.run(Thread.java:619)
> Caused by: org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: file:/user/user1/pig/tests/data/singlefile/studenttab10k
>         at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus(FileInputFormat.java:243)
>         at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigTextInputFormat.listStatus(PigTextInputFormat.java:36)
>         at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits(FileInputFormat.java:269)
>         at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:270)
>         ... 12 more
>         /user/user1/pig/out/user1.1321317375/GroupAggFunc_12.out,
> Input(s):
> Failed to read data from "/user/user1/pig/tests/data/singlefile/studenttab10k"
> Output(s):
> Failed to produce result in "/user/user1/pig/out/user1.1321317375/GroupAggFunc_12.out"
> File does exist from hdfs:
> -rw-r--r--   3 user1 hdfs     219190 2011-11-15 22:49 /user/user1/pig/tests/data/singlefile/studenttab10k
> Failed tests:
> GroupAggFunc_12 and GroupAggFunc_13

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira