You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@giraph.apache.org by "Alexandre Fonseca (JIRA)" <ji...@apache.org> on 2013/12/09 22:44:07 UTC

[jira] [Updated] (GIRAPH-814) Incorrect MapReduce application classpath processing

     [ https://issues.apache.org/jira/browse/GIRAPH-814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexandre Fonseca updated GIRAPH-814:
-------------------------------------

    Attachment: GIRAPH-814.patch

This patch should fix the issue.

Passes mvn verify and tested on the SimpleShortestPaths example on a local cluster.

> Incorrect MapReduce application classpath processing
> ----------------------------------------------------
>
>                 Key: GIRAPH-814
>                 URL: https://issues.apache.org/jira/browse/GIRAPH-814
>             Project: Giraph
>          Issue Type: Bug
>    Affects Versions: 1.1.0
>            Reporter: Alexandre Fonseca
>              Labels: yarn
>         Attachments: GIRAPH-814.patch
>
>
> *Symptom:*
> Yarn ApplicationManager is unable to find mapred classes if user does not override mapreduce.application.classpath in mapred-site.xml:
> {code}(GiraphApplicationMaster.java:main(442)) - GiraphApplicationMaster caught a t$
> java.lang.NoClassDefFoundError: org/apache/hadoop/mapreduce/lib/output/TextOutputFormat{code}
> *Culprit:*
> Processing of default mapreduce application classpath in addLocalClasspathToEnv(..)  YarnUtils.java:180
> *Reasoning:*
> YarnConfiguration.DEFAULT_YARN_APPLICATION_CLASSPATH is defined as an array of strings as per YarnConfiguration.java:832:
> {code}  public static final String[] DEFAULT_YARN_APPLICATION_CLASSPATH = {
>       ApplicationConstants.Environment.HADOOP_CONF_DIR.$(),
>       ApplicationConstants.Environment.HADOOP_COMMON_HOME.$()
>           + "/share/hadoop/common/*",
>       ApplicationConstants.Environment.HADOOP_COMMON_HOME.$()
>           + "/share/hadoop/common/lib/*",
>       ApplicationConstants.Environment.HADOOP_HDFS_HOME.$()
>           + "/share/hadoop/hdfs/*",
>       ApplicationConstants.Environment.HADOOP_HDFS_HOME.$()
>           + "/share/hadoop/hdfs/lib/*",
>       ApplicationConstants.Environment.HADOOP_YARN_HOME.$()
>           + "/share/hadoop/yarn/*",
>       ApplicationConstants.Environment.HADOOP_YARN_HOME.$()
>           + "/share/hadoop/yarn/lib/*" };{code}
> MRJobConfig.DEFAULT_MAPREDUCE_APPLICATION_CLASSPATH is defined as a comma separated string as per MRJobConfig.java:679:
> {code}  public final String 
>   DEFAULT_MAPREDUCE_APPLICATION_CLASSPATH = Shell.WINDOWS ?
>       "%HADOOP_MAPRED_HOME%\\share\\hadoop\\mapreduce\\*," 
>       + "%HADOOP_MAPRED_HOME%\\share\\hadoop\\mapreduce\\lib\\*" :
>       "$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*,"
>       + "$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/*";{code}
> However, in YarnUtils.java:190, DEFAULT_MAPREDUCE_APPLICATION_CLASSPATH is treated as if it were an array of strings just as YARN_APPLICATION_CLASSPATH some lines before. This results in an incorrect classpath if the user relies on the default setting of MAPREDUCE_APPLICATION_CLASSPATH (notice the comma between the last 2 entries that should be a colon):
> {code}13/12/09 21:54:56 INFO yarn.GiraphYarnClient: Environment for AM :
> {CLASSPATH=${CLASSPATH}:./*:$HADOOP_CONF_DIR:
> $HADOOP_COMMON_HOME/share/hadoop/common/*:
> $HADOOP_COMMON_HOME/share/hadoop/common/lib/*:
> $HADOOP_HDFS_HOME/share/hadoop/hdfs/*:
> $HADOOP_HDFS_HOME/share/hadoop/hdfs/lib/*:
> $HADOOP_YARN_HOME/share/hadoop/yarn/*:
> $HADOOP_YARN_HOME/share/hadoop/yarn/lib/*:
> $HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*,
> $HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/*}{code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)