You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@slider.apache.org by "Billie Rinaldi (JIRA)" <ji...@apache.org> on 2014/08/21 18:13:11 UTC

[jira] [Updated] (SLIDER-343) Command files have incorrect hostname

     [ https://issues.apache.org/jira/browse/SLIDER-343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Billie Rinaldi updated SLIDER-343:
----------------------------------

    Summary: Command files have incorrect hostname  (was: App processes not killed)

> Command files have incorrect hostname
> -------------------------------------
>
>                 Key: SLIDER-343
>                 URL: https://issues.apache.org/jira/browse/SLIDER-343
>             Project: Slider
>          Issue Type: Bug
>            Reporter: Billie Rinaldi
>
> I'm trying to fix an issue with how the Accumulo app package starts its processes (currently they don't know their hostname).  After making one change that isn't quite working yet, the app instance has ended up in a state where the app has finished and is destroyed, but there are still app processes running on the cluster.
> Output of slider list:
> version: 1.6.1-snapshot
> name: accumulo
> state: FINISHED
> URL: http://node-1.example.com:8088/proxy/application_1408460425050_0028/A
> Started 21 Aug 2014 14:59:45 GMT
> Finished 21 Aug 2014 15:02:55 GMT
> RPC :node-1.example.com:38102
> Diagnostics :org.apache.slider.core.exceptions.TriggerClusterTeardownException: Unstable Application Instance : - failed with role ACCUMULO_TSERVER failing 6 times (2 in startup); threshold is 5 - last failure: Failure container_1408460425050_0028_01_000029 on host node-3.example.com, see http://node-1.example.com:19888/jobhistory/logs/node-3.example.com:45454/container_1408460425050_0028_01_000029/ctx/billie
> Output of ps -ef | grep accumulo | grep master (only one master was requested):
> yarn     11888     1  0 08:00 ?        00:00:11 /usr/lib/jvm/java/bin/java -Dapp=master -XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=75 -Djava.net.preferIPv4Stack=true -Xmx128m -Xms128m -classpath /yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000002/app/install/accumulo-1.6.1-SNAPSHOT/conf:/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000002/app/install/accumulo-1.6.1-SNAPSHOT/lib/accumulo-start.jar:/usr/lib/hadoop/lib/log4j-1.2.17.jar -XX:OnOutOfMemoryError=kill -9 %p -XX:-OmitStackTraceInFastThrow -Djavax.xml.parsers.DocumentBuilderFactory=com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderFactoryImpl -Dorg.apache.accumulo.core.home.dir=/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000002/app/install/accumulo-1.6.1-SNAPSHOT -Dhadoop.home.dir=/usr/lib/hadoop -Dzookeeper.home.dir=/usr/lib/zookeeper org.apache.accumulo.start.Main master --address node-1.example.com
> yarn     12278     1  0 08:00 ?        00:00:07 /usr/lib/jvm/java/bin/java -Dapp=master -XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=75 -Djava.net.preferIPv4Stack=true -Xmx128m -Xms128m -classpath /yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000008/app/install/accumulo-1.6.1-SNAPSHOT/conf:/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000008/app/install/accumulo-1.6.1-SNAPSHOT/lib/accumulo-start.jar:/usr/lib/hadoop/lib/log4j-1.2.17.jar -XX:OnOutOfMemoryError=kill -9 %p -XX:-OmitStackTraceInFastThrow -Djavax.xml.parsers.DocumentBuilderFactory=com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderFactoryImpl -Dorg.apache.accumulo.core.home.dir=/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000008/app/install/accumulo-1.6.1-SNAPSHOT -Dhadoop.home.dir=/usr/lib/hadoop -Dzookeeper.home.dir=/usr/lib/zookeeper org.apache.accumulo.start.Main master --address node-1.example.com
> yarn     12594     1  0 08:01 ?        00:00:07 /usr/lib/jvm/java/bin/java -Dapp=master -XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=75 -Djava.net.preferIPv4Stack=true -Xmx128m -Xms128m -classpath /yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000013/app/install/accumulo-1.6.1-SNAPSHOT/conf:/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000013/app/install/accumulo-1.6.1-SNAPSHOT/lib/accumulo-start.jar:/usr/lib/hadoop/lib/log4j-1.2.17.jar -XX:OnOutOfMemoryError=kill -9 %p -XX:-OmitStackTraceInFastThrow -Djavax.xml.parsers.DocumentBuilderFactory=com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderFactoryImpl -Dorg.apache.accumulo.core.home.dir=/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000013/app/install/accumulo-1.6.1-SNAPSHOT -Dhadoop.home.dir=/usr/lib/hadoop -Dzookeeper.home.dir=/usr/lib/zookeeper org.apache.accumulo.start.Main master --address node-1.example.com
> yarn     13006     1  0 08:02 ?        00:00:07 /usr/lib/jvm/java/bin/java -Dapp=master -XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=75 -Djava.net.preferIPv4Stack=true -Xmx128m -Xms128m -classpath /yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000018/app/install/accumulo-1.6.1-SNAPSHOT/conf:/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000018/app/install/accumulo-1.6.1-SNAPSHOT/lib/accumulo-start.jar:/usr/lib/hadoop/lib/log4j-1.2.17.jar -XX:OnOutOfMemoryError=kill -9 %p -XX:-OmitStackTraceInFastThrow -Djavax.xml.parsers.DocumentBuilderFactory=com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderFactoryImpl -Dorg.apache.accumulo.core.home.dir=/yarn/local/usercache/billie/appcache/application_1408460425050_0028/container_1408460425050_0028_01_000018/app/install/accumulo-1.6.1-SNAPSHOT -Dhadoop.home.dir=/usr/lib/hadoop -Dzookeeper.home.dir=/usr/lib/zookeeper org.apache.accumulo.start.Main master --address node-1.example.com



--
This message was sent by Atlassian JIRA
(v6.2#6252)