You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by Anandha L Ranganathan <an...@gmail.com> on 2016/06/17 19:10:40 UTC

HDP 2.4 upgrade using ambari - service check failed for Tez at 82%

I am upgrading one of our cluster from HDP 2.2 to HDP 2.4.0. version.

80% of upgrade is completed and all the core, slave,hive, spark are
upgraded to latest version of HDP 2.4 . During the service check phase, Tez
component is failed.

The status I see in the Application monitoring URL is

YARN Applicaiton Status: ACCEPTED: waiting for AM container to be
allocated, launched and register with RM.

It waits in that state for sometime(300 seconds) and dies and the service
checke is failed.



All nodes are live and Active status.

Also we are able to run MR job for the user ambari-qa , but it is failing
for Tez job.

We try to run the job manually , and the job stops at this point.

hadoop --config /usr/hdp/2.4.0.0-169/hadoop/conf jar
/usr/hdp/current/tez-client/tez-examples*.jar orderedwordcount
/tmp/tezsmokeinput/sample-tez-test /tmp/tezsmokeoutput1/
WARNING: Use "yarn jar" to launch YARN applications.
16/06/17 19:04:47 INFO client.TezClient: Tez Client Version: [
component=tez-api, version=0.7.0.2.4.0.0-169,
revision=3c1431f45faaca982ecc8dad13a107787b834696,
SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git,
buildTime=20160210-0711 ]
16/06/17 19:04:47 INFO impl.TimelineClientImpl: Timeline service
address: http://usw2stdpma03.glassdoor.local:8188/ws/v1/timeline/
16/06/17 19:04:48 INFO client.RMProxy: Connecting to ResourceManager
at usw2stdpma03.glassdoor.local/172.17.212.107:8050
16/06/17 19:04:48 INFO client.TezClient: Using
org.apache.tez.dag.history.ats.acls.ATSHistoryACLPolicyManager to
manage Timeline ACLs
16/06/17 19:04:48 INFO impl.TimelineClientImpl: Timeline service
address: http://usw2stdpma03.glassdoor.local:8188/ws/v1/timeline/
16/06/17 19:04:49 INFO examples.OrderedWordCount: Running OrderedWordCount
16/06/17 19:04:49 INFO client.TezClient: Submitting DAG application
with id: application_1466115469995_0142
16/06/17 19:04:49 INFO client.TezClientUtils: Using tez.lib.uris value
from configuration: /hdp/apps/2.4.0.0-169/tez/tez.tar.gz
16/06/17 19:04:49 INFO client.TezClient: Stage directory
/tmp/root/staging doesn't exist and is created
16/06/17 19:04:49 INFO client.TezClient: Tez system stage directory
hdfs://dfs-nameservices/tmp/root/staging/.tez/application_1466115469995_0142
doesn't exist and is created
16/06/17 19:04:49 INFO acls.ATSHistoryACLPolicyManager: Created
Timeline Domain for History ACLs,
domainId=Tez_ATS_application_1466115469995_0142
16/06/17 19:04:50 INFO client.TezClient: Submitting DAG to YARN,
applicationId=application_1466115469995_0142,
dagName=OrderedWordCount, callerContext={ context=TezExamples,
callerType=null, callerId=null }
16/06/17 19:04:50 INFO impl.YarnClientImpl: Submitted application
application_1466115469995_0142
16/06/17 19:04:50 INFO client.TezClient: The url to track the Tez AM:
http://usw2stdpma03.glassdoor.local:8088/proxy/application_1466115469995_0142/
16/06/17 19:04:50 INFO impl.TimelineClientImpl: Timeline service
address: http://usw2stdpma03.glassdoor.local:8188/ws/v1/timeline/
16/06/17 19:04:50 INFO client.RMProxy: Connecting to ResourceManager
at usw2stdpma03.glassdoor.local/172.17.212.107:8050
16/06/17 19:04:51 INFO client.DAGClientImpl: Waiting for DAG to start running



how do I fix this problem ?

Thanks
Anand