You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Hitesh Shah (JIRA)" <ji...@apache.org> on 2015/05/26 19:35:20 UTC
[jira] [Commented] (TEZ-2484) Tez vertex for Hive fails but
Resource Manager reports job succeeded
[ https://issues.apache.org/jira/browse/TEZ-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14559463#comment-14559463 ]
Hitesh Shah commented on TEZ-2484:
----------------------------------
This is related to how hive is using Tez sessions. There is no 1:1 relationship between a yarn application and a Hive query hence the application status cannot be mapped to the failure of one of the queries that ran within a given Tez application on yarn.
> Tez vertex for Hive fails but Resource Manager reports job succeeded
> --------------------------------------------------------------------
>
> Key: TEZ-2484
> URL: https://issues.apache.org/jira/browse/TEZ-2484
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.5.2
> Environment: HDP 2.2.4.2
> Reporter: Hari Sekhon
> Attachments: Tez_RM_misreporting_succeeded.png
>
>
> When running a Hive on Tez job via Hive CLI the job fails and I get the error shown below but in the Resource Manager the job is shown as "Succeeded", even though it's clearly failed:
> {code}
> Status: Running (Executing on YARN cluster with App id application_1432310690008_0103)
> --------------------------------------------------------------------------------
> VERTICES STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED
> --------------------------------------------------------------------------------
> Map 1 FAILED 1478 0 0 1478 1 1477
> --------------------------------------------------------------------------------
> VERTICES: 00/01 [>>--------------------------] 0% ELAPSED TIME: 1589.41 s
> --------------------------------------------------------------------------------
> Status: Failed
> Vertex failed, vertexName=Map 1, vertexId=vertex_1432310690008_0103_1_00, diagnostics=[Task failed, taskId=task_1432310690008_0103_1_00_000000, diagnostics=[TaskAttempt 0 failed, info=[ Containercontainer_e122_1432310690008_0103_01_000094 received a STOP_REQUEST]], Vertex failed as one or more tasks failed. failedTasks:1, Vertex vertex_1432310690008_0103_1_00 [Map 1] killed/failed due to:null]
> DAG failed due to vertex failure. failedVertices:1 killedVertices:0
> FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)