You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Rares Vernica (JIRA)" <ji...@apache.org> on 2015/08/24 20:46:45 UTC
[jira] [Created] (SPARK-10187) Sometimes Web UI reports Application
history not found
Rares Vernica created SPARK-10187:
-------------------------------------
Summary: Sometimes Web UI reports Application history not found
Key: SPARK-10187
URL: https://issues.apache.org/jira/browse/SPARK-10187
Project: Spark
Issue Type: Bug
Components: Web UI
Affects Versions: 1.4.1
Environment: CentOS Linux release 7.1.1503 (Core)
Linux 3.10.0-229.4.2.el7.x86_64 #1 SMP Wed May 13 10:06:09 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
Reporter: Rares Vernica
Priority: Minor
On the Web UI home page {{http://localhost:8080/}} in the list of Completed Applications clicking on the Name of a job shows the history of that job. This works for most of the time. For some jobs, Spark returns:
{quote}
Application history not found (app-20150824104953-0018)
No event logs found for application bid-00001-ph-1 in file:///data/disk1/spark/events. Did you specify the correct logging directory?
{quote}
All these jobs have the FINISHED state. {{spark-defaults.conf}} contains
{quote}
spark.eventLog.dir file:///data/disk1/spark/events
spark.history.fs.logDirectory file:///data/disk1/spark/events
{quote}
{{/data/disk1/spark/events}} is a normal directory on the disk. This works fine for some jobs but not others (even jobs on the same Spark session).
The Spark History UI shows correctly the history of the jobs that the Web UI cannot find the logs for. So, I believe the log directory is fine.
Here is the HTML snippet from the Web UI home page, under the Completed Applications section. Notice how the UI finds the history for some jobs, but not others:
{quote}
...
<a href="app?appId=app-20150824110804-0019">app-20150824110804-0019</a>
...
<a href="/history/app-20150824110804-0019">bid-00001-ph-2</a>
...
<a href="app?appId=app-20150824104953-0018">app-20150824104953-0018</a>
...
<a href="/history/not-found?msg=No+event+logs+found+for+application+bid-00001-ph-1+in+file%3A%2F%2F%2Fdata%2Fdisk1%2Fspark%2Fevents.+Did+you+specify+the+correct+logging+directory%3F&title=Application history not found (app-20150824104953-0018)">bid-00001-ph-1</a>
...
<a href="app?appId=app-20150824104907-0017">app-20150824104907-0017</a>
...
<a href="/history/not-found?msg=No+event+logs+found+for+application+bid-00001-ph-2+in+file%3A%2F%2F%2Fdata%2Fdisk1%2Fspark%2Fevents.+Did+you+specify+the+correct+logging+directory%3F&title=Application history not found (app-20150824104907-0017)">bid-00001-ph-2</a>
...
<a href="app?appId=app-20150824103131-0016">app-20150824103131-0016</a>
...
<a href="/history/app-20150824103131-0016">bid-00001-ph-1</a>
{quote}
These are sequential jobs and all the necessary files and directories in {{/data/disk1/spark/events}} and present and accessible and Spark History UI shows the history for all these jobs.
I looked around JIRA for similar issues, I think this is related to SPARK-6107 and SPARK-6950 but not a duplicate. The jobs for which these manifests are all FINISHED for some time.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org