You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Masayoshi TSUZUKI (JIRA)" <ji...@apache.org> on 2014/11/25 04:02:12 UTC

[jira] [Commented] (SPARK-4539) History Server counts "incomplete" applications against the "retainedApplications" total, fails to show eligible "completed" applications

    [ https://issues.apache.org/jira/browse/SPARK-4539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14223979#comment-14223979 ] 

Masayoshi TSUZUKI commented on SPARK-4539:
------------------------------------------

I assume you mean the parameter "spark.history.retainedApplications".
It is not the value of limit the number of apps listed on the HistoryServer UI, but the number of caches of the application detail info which is shown when we click the listed "App ID" link.
Even when spark.history.retainedApplications is set as 2, we can see more than 10 apps listed.

By the way, I think your operation doesn't work properly.
As you know, just copying some existing application directory doesn't work because they both have same application id in EVENT_LOG_1 so it is needed to be modified.
If there are 2 app directories which have the same application id, HistoryServer skips listing.
And HistoryServer read only the directory whose modification time is later than the log directory was loaded last time.
So please try
  * update the modification time of the directory after you modified EVENT_LOG_1.
  * make sure you don't see the browser cache.

It works for me.
And of course, restarting HistoryServer is also a good idea to get all apps listed.


> History Server counts "incomplete" applications against the "retainedApplications" total, fails to show eligible "completed" applications
> -----------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-4539
>                 URL: https://issues.apache.org/jira/browse/SPARK-4539
>             Project: Spark
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Ryan Williams
>
> I have observed the history server to return 0 or 1 applications from a directory that contains many complete and incomplete applications (the latter being application directories that are missing the {{APPLICATION_COMPLETE}} file).
> Without having dug too much, my theory is that HistoryServer is seeing the "incomplete" directories and counting them against the {{retainedApplications}} maximum but not displaying them.
> One supporting anecdote for this is that I loaded HS against a directory that had one complete application and nothing else, and HS worked as expected (I saw the one application in the web UI).
> I then copied ~100 other application directories in, the majority of which were "incomplete" (in particular, most of the ones that had the earliest timestamps), and still only saw the one original completed application via the web UI.
> Finally, I restarted the same server with the {{retainedApplications}} set to 1000 (instead of 50; the directory a this point had ~10 completed applications and 90 incomplete ones), and saw all/exactly the completed applications, leading me to believe that they were being "boxed out" of the maximum-50-retained-applications iteration of the history server.
> Silently failing on "incomplete" directories while still docking the count, if that is indeed what is happening, is a pretty confusing failure mode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org