You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Imran Rashid (JIRA)" <ji...@apache.org> on 2018/06/13 03:19:00 UTC

[jira] [Created] (SPARK-24539) HistoryServer does not display metrics from tasks that complete after stage failure

Imran Rashid created SPARK-24539:
------------------------------------

             Summary: HistoryServer does not display metrics from tasks that complete after stage failure
                 Key: SPARK-24539
                 URL: https://issues.apache.org/jira/browse/SPARK-24539
             Project: Spark
          Issue Type: Bug
          Components: Web UI
    Affects Versions: 2.3.1
            Reporter: Imran Rashid


I noticed that task metrics for completed tasks with a stage failure do not show up in the new history server.  I have a feeling this is because all of the tasks succeeded *after* the stage had been failed (so they were completions from a "zombie" taskset).  The task metrics (eg. the shuffle read size & shuffle write size) do not show up at all, either in the task table, the executor table, or the overall stage summary metrics.  (they might not show up in the job summary page either, but in the event logs I have, there is another successful stage attempt after this one, and that is the only thing which shows up in the jobs page.)  If you get task details from the api endpoint (eg. http://[host]:[port]/api/v1/applications/[app-id]/stages/[stage-id]/[stage-attempt]) then you can see the successful tasks and all the metrics

Unfortunately the event logs I have are huge and I don't have a small repro handy, but I hope that description is enough to go on.

I loaded the event logs I have in the SHS from spark 2.2 and they appear fine.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org