You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Kay Ousterhout (JIRA)" <ji...@apache.org> on 2015/07/31 06:12:04 UTC
[jira] [Commented] (SPARK-9497) Flaky test: DistributedSuite failed
after the test of "repeatedly failing task that crashes JVM"
[ https://issues.apache.org/jira/browse/SPARK-9497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14648721#comment-14648721 ]
Kay Ousterhout commented on SPARK-9497:
---------------------------------------
When did you start seeing these failures? I added the test 1.5 years ago, so it seems likely to be related to a recent change in the code.
It looks like this error is happening when we try to stop the AppClient from SparkDeploySchedulerBackend: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/cluster/SparkDeploySchedulerBackend.scala#L96; is it possible this started happening after [~zsxwing]'s recent change? https://github.com/apache/spark/commit/3bee0f1466ddd69f26e95297b5e0d2398b6c6268 (that changed the line of code in AppClient.scala that is failing: https://github.com/apache/spark/blame/master/core/src/main/scala/org/apache/spark/deploy/client/AppClient.scala#L251)
> Flaky test: DistributedSuite failed after the test of "repeatedly failing task that crashes JVM"
> ------------------------------------------------------------------------------------------------
>
> Key: SPARK-9497
> URL: https://issues.apache.org/jira/browse/SPARK-9497
> Project: Spark
> Issue Type: Bug
> Components: Tests
> Reporter: Yin Huai
> Labels: flaky-test
>
> Seems it is pretty often to see DistributedSuite failed right after "repeatedly failing task that crashes JVM".
> One example jenkins can be found at
> https://amplab.cs.berkeley.edu/jenkins/job/Spark-Master-SBT/3117/AMPLAB_JENKINS_BUILD_PROFILE=hadoop1.0,label=centos/testReport/junit/org.apache.spark/DistributedSuite/
> The log of it can be found at https://amplab.cs.berkeley.edu/jenkins/job/Spark-Master-SBT/3117/AMPLAB_JENKINS_BUILD_PROFILE=hadoop1.0,label=centos/artifact/core/target/unit-tests.log (search StopAppClient).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org