You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/06/20 04:11:00 UTC

[jira] [Commented] (SPARK-28080) There is a problem to download and watch offline the history of an application with multiple attempts due to UI inconsistency

    [ https://issues.apache.org/jira/browse/SPARK-28080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16868247#comment-16868247 ] 

Hyukjin Kwon commented on SPARK-28080:
--------------------------------------

Can you attach screenshots of UI?

> There is a problem to download and watch offline the history of an application with multiple attempts due to UI inconsistency
> -----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-28080
>                 URL: https://issues.apache.org/jira/browse/SPARK-28080
>             Project: Spark
>          Issue Type: Bug
>          Components: Web UI
>    Affects Versions: 2.3.1
>         Environment: I used the spark-2.4.3-bin-hadoop2.7 and spark-2.3.1-bin-hadoop2.7 packages from [https://spark.apache.org/downloads.html]
> Running the history server locally as-is (using default values) on ubunto 16.04.4 running using WSL (Windows Subsystem for Linux) on my windows 10 machine.
> Browser used is firefox 67.0.2 (64-bit) for windows
>            Reporter: Gal Weiss
>            Priority: Minor
>              Labels: history, historyserver, ui
>
> Overview:
> If you are looking to watch locally a spark application attempt history, trying to see the history of the first attempt (or any attempt but the last one) would fail, because some UI inconsistently.
> The inconsistency is that in the spark history UI, the "app_id" column is clickable and will always take you to this application *last attempt*, but if you tried to download only the first attempt, you will get an error of application not found.
>  
> How to reproduce:
>  # open spark any spark history server (if using Azure HDinsight the address would be https://<cluster name>.azurehdinsight.net/sparkhistory/)
>  # look for an application that have multiple attempts (ie - attempt ID > 1)
>  # look for the *first* attempt in this application and download it using the "download" button in the event column. save it in your local spark history folder (default: /tmp/spark-events)
>  # Start a local spark history server (typically: using the start-history-server.sh script)
>  # browse to the local history server and look for the application for which you downloaded the history.
>  # click the application name in the "App ID" column, and you would get the following error:
> "Application <your application ID> not found."
> Why ?
> because on the remote history server it is assumed that all Attempts history files are preset, so the "App ID" column points to the latest attempt of this app, while the "Attempt  ID" column points to the specific attempt.
> But if we have an application with two attempts, and we only want to research the first one, we download it locally, opening with the local history server, and intuitively clicking the link in the "app id" column, the link actually points to the second attempt, which we haven't even downloaded. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org