You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Aniket Mokashi (JIRA)" <ji...@apache.org> on 2014/03/21 03:12:42 UTC

[jira] [Updated] (PIG-3825) Stats collection needs to be changed for hadoop2 (with auto local mode)

     [ https://issues.apache.org/jira/browse/PIG-3825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aniket Mokashi updated PIG-3825:
--------------------------------

    Attachment: PIG-3825.patch

> Stats collection needs to be changed for hadoop2 (with auto local mode)
> -----------------------------------------------------------------------
>
>                 Key: PIG-3825
>                 URL: https://issues.apache.org/jira/browse/PIG-3825
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.13.0
>            Reporter: Aniket Mokashi
>            Assignee: Aniket Mokashi
>             Fix For: 0.13.0
>
>         Attachments: PIG-3825.patch
>
>
> With auto local mode, hadoop 2 getJob, getMapTaskReports,  getReduceTaskReports api cause NumberFormatException with stack similar to following-
> (I have also noticed that getMapTaskReports causes memory leak and OOMs for long running jobs as hadoop 2 map task reports are quite bulky, but that is a separate issue)
> {noformat}
> Caused by: java.lang.NumberFormatException: For input string: "local154006779"
>         at java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
>         at java.lang.Long.parseLong(Long.java:410)
>         at java.lang.Long.parseLong(Long.java:468)
>         at org.apache.hadoop.mapreduce.JobID.toClusterTimeStamp(JobID.java:172)
>         at org.apache.hadoop.mapreduce.JobID.getAppId(JobID.java:167)
>         at org.apache.hadoop.mapreduce.TypeConverter.toYarn(TypeConverter.java:79)
>         at org.apache.hadoop.mapred.ClientServiceDelegate.<init>(ClientServiceDelegate.java:114)
>         at org.apache.hadoop.mapred.ClientCache.getClient(ClientCache.java:68)
>         at org.apache.hadoop.mapred.YARNRunner.getJobStatus(YARNRunner.java:550)
>         at org.apache.hadoop.mapreduce.Cluster.getJob(Cluster.java:182)
>         at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:586)
>         at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:584)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:396)
>         at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1478)
>         at org.apache.hadoop.mapred.JobClient.getJobUsingCluster(JobClient.java:584)
>         at org.apache.hadoop.mapred.JobClient.getTaskReports(JobClient.java:638)
>         at org.apache.hadoop.mapred.JobClient.getMapTaskReports(JobClient.java:632)
>         at org.apache.pig.tools.pigstats.mapreduce.MRJobStats.addMapReduceStatistics(MRJobStats.java:318)
>         at org.apache.pig.tools.pigstats.mapreduce.MRPigStatsUtil.addSuccessJobStats(MRPigStatsUtil.java:308)
>         at org.apache.pig.tools.pigstats.mapreduce.MRPigStatsUtil.accumulateStats(MRPigStatsUtil.java:240)
>         at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:363)
>         at org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.launchPig(HExecutionEngine.java:374)
>         ... 16 more
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)