You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Andrew Or (JIRA)" <ji...@apache.org> on 2015/05/19 01:05:59 UTC

[jira] [Updated] (SPARK-7716) SparkUI stage page hangs with many tasks

     [ https://issues.apache.org/jira/browse/SPARK-7716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrew Or updated SPARK-7716:
-----------------------------
    Description: 
While running the following, I checked the stage page on the SparkUI:
{code}
sc.parallelize(1 to 5000, 10000).count()
{code}

Then I get
{code}
HTTP ERROR 500

Problem accessing /stages/stage/. Reason:

    Server Error
Caused by:

java.lang.OutOfMemoryError: Java heap space
	at java.util.Arrays.copyOf(Arrays.java:2367)
	at java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:130)
	at java.lang.AbstractStringBuilder.ensureCapacityInternal(AbstractStringBuilder.java:114)
	at java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:587)
	at java.lang.StringBuilder.append(StringBuilder.java:214)
{code}

This is because end up concatenating all the scala XML nodes into raw strings and shipping them to the UI through Jetty. The long-term correct fix would be to add pagination, but even adding a compression layer will fix this for most cases.


  was:
While running the following, I checked the stage page on the SparkUI:
{code}
sc.parallelize(1 to 5000, 100000).count()
{code}

Then I get
{code}
HTTP ERROR 500

Problem accessing /stages/stage/. Reason:

    Server Error
Caused by:

java.lang.OutOfMemoryError: Java heap space
	at java.util.Arrays.copyOf(Arrays.java:2367)
	at java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:130)
	at java.lang.AbstractStringBuilder.ensureCapacityInternal(AbstractStringBuilder.java:114)
	at java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:587)
	at java.lang.StringBuilder.append(StringBuilder.java:214)
{code}

This is because end up concatenating all the scala XML nodes into raw strings and shipping them to the UI through Jetty. The long-term correct fix would be to add pagination, but even adding a compression layer will fix this for most cases.



> SparkUI stage page hangs with many tasks
> ----------------------------------------
>
>                 Key: SPARK-7716
>                 URL: https://issues.apache.org/jira/browse/SPARK-7716
>             Project: Spark
>          Issue Type: Bug
>          Components: Web UI
>    Affects Versions: 1.4.0
>            Reporter: Andrew Or
>            Assignee: Andrew Or
>            Priority: Critical
>
> While running the following, I checked the stage page on the SparkUI:
> {code}
> sc.parallelize(1 to 5000, 10000).count()
> {code}
> Then I get
> {code}
> HTTP ERROR 500
> Problem accessing /stages/stage/. Reason:
>     Server Error
> Caused by:
> java.lang.OutOfMemoryError: Java heap space
> 	at java.util.Arrays.copyOf(Arrays.java:2367)
> 	at java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:130)
> 	at java.lang.AbstractStringBuilder.ensureCapacityInternal(AbstractStringBuilder.java:114)
> 	at java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:587)
> 	at java.lang.StringBuilder.append(StringBuilder.java:214)
> {code}
> This is because end up concatenating all the scala XML nodes into raw strings and shipping them to the UI through Jetty. The long-term correct fix would be to add pagination, but even adding a compression layer will fix this for most cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org