You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Jungtaek Lim (Jira)" <ji...@apache.org> on 2019/09/20 04:24:00 UTC

[jira] [Commented] (SPARK-29139) Flaky test: org.apache.spark.SparkContextSuite.test gpu driver resource files and discovery under local-cluster mode

    [ https://issues.apache.org/jira/browse/SPARK-29139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16934034#comment-16934034 ] 

Jungtaek Lim commented on SPARK-29139:
--------------------------------------

Looks like the build was super slow at that moment:
|[test driver discovery under local-cluster mode|https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/110759/testReport/org.apache.spark/SparkContextSuite/test_driver_discovery_under_local_cluster_mode]|15 sec|Failed|
|[test gpu driver resource files and discovery under local-cluster mode|https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/110759/testReport/org.apache.spark/SparkContextSuite/test_gpu_driver_resource_files_and_discovery_under_local_cluster_mode]|10 sec|Failed|
|[test resource scheduling under local-cluster mode|https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/110759/testReport/org.apache.spark/SparkContextSuite/test_resource_scheduling_under_local_cluster_mode]|31 sec|Passed|

Though "test resource scheduling under local-cluster mode" was successful, it has been elapsed mostly under 10 secs, even under 20 secs for longest around 5 pages of history.

[https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/110759/testReport/junit/org.apache.spark/SparkContextSuite/test_resource_scheduling_under_local_cluster_mode/history]

Other tests should have pretty higher timeout like it to handle such kind of slowness.

> Flaky test: org.apache.spark.SparkContextSuite.test gpu driver resource files and discovery under local-cluster mode
> --------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-29139
>                 URL: https://issues.apache.org/jira/browse/SPARK-29139
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core, Tests
>    Affects Versions: 3.0.0
>            Reporter: Jungtaek Lim
>            Priority: Major
>
> [https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/110759/testReport/]
> {code:java}
> sbt.ForkMain$ForkError: java.util.concurrent.TimeoutException: Can't find 1 executors before 10000 milliseconds elapsed
> 	at org.apache.spark.TestUtils$.waitUntilExecutorsUp(TestUtils.scala:293)
> 	at org.apache.spark.SparkContextSuite.$anonfun$new$82(SparkContextSuite.scala:793)
> 	at org.apache.spark.SparkContextSuite.$anonfun$new$82$adapted(SparkContextSuite.scala:772)
> 	at org.apache.spark.SparkFunSuite.withTempDir(SparkFunSuite.scala:161)
> 	at org.apache.spark.SparkContextSuite.$anonfun$new$81(SparkContextSuite.scala:772)
> 	at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
> 	at org.scalatest.OutcomeOf.outcomeOf(OutcomeOf.scala:85)
> 	at org.scalatest.OutcomeOf.outcomeOf$(OutcomeOf.scala:83)
> 	at org.scalatest.OutcomeOf$.outcomeOf(OutcomeOf.scala:104)
> 	at org.scalatest.Transformer.apply(Transformer.scala:22)
> 	at org.scalatest.Transformer.apply(Transformer.scala:20)
> 	at org.scalatest.FunSuiteLike$$anon$1.apply(FunSuiteLike.scala:186)
> 	at org.apache.spark.SparkFunSuite.withFixture(SparkFunSuite.scala:149)
> 	at org.scalatest.FunSuiteLike.invokeWithFixture$1(FunSuiteLike.scala:184)
> 	at org.scalatest.FunSuiteLike.$anonfun$runTest$1(FunSuiteLike.scala:196)
> 	at org.scalatest.SuperEngine.runTestImpl(Engine.scala:289)
> 	at org.scalatest.FunSuiteLike.runTest(FunSuiteLike.scala:196)
> 	at org.scalatest.FunSuiteLike.runTest$(FunSuiteLike.scala:178)
> 	at org.apache.spark.SparkFunSuite.org$scalatest$BeforeAndAfterEach$$super$runTest(SparkFunSuite.scala:56)
> 	at org.scalatest.BeforeAndAfterEach.runTest(BeforeAndAfterEach.scala:221)
> 	at org.scalatest.BeforeAndAfterEach.runTest$(BeforeAndAfterEach.scala:214)
> 	at org.apache.spark.SparkFunSuite.runTest(SparkFunSuite.scala:56)
> 	at org.scalatest.FunSuiteLike.$anonfun$runTests$1(FunSuiteLike.scala:229)
> 	at org.scalatest.SuperEngine.$anonfun$runTestsInBranch$1(Engine.scala:396)
> 	at scala.collection.immutable.List.foreach(List.scala:392)
> 	at org.scalatest.SuperEngine.traverseSubNodes$1(Engine.scala:384)
> 	at org.scalatest.SuperEngine.runTestsInBranch(Engine.scala:379)
> 	at org.scalatest.SuperEngine.runTestsImpl(Engine.scala:461)
> 	at org.scalatest.FunSuiteLike.runTests(FunSuiteLike.scala:229)
> 	at org.scalatest.FunSuiteLike.runTests$(FunSuiteLike.scala:228)
> 	at org.scalatest.FunSuite.runTests(FunSuite.scala:1560)
> 	at org.scalatest.Suite.run(Suite.scala:1147)
> 	at org.scalatest.Suite.run$(Suite.scala:1129)
> 	at org.scalatest.FunSuite.org$scalatest$FunSuiteLike$$super$run(FunSuite.scala:1560)
> 	at org.scalatest.FunSuiteLike.$anonfun$run$1(FunSuiteLike.scala:233)
> 	at org.scalatest.SuperEngine.runImpl(Engine.scala:521)
> 	at org.scalatest.FunSuiteLike.run(FunSuiteLike.scala:233)
> 	at org.scalatest.FunSuiteLike.run$(FunSuiteLike.scala:232)
> 	at org.apache.spark.SparkFunSuite.org$scalatest$BeforeAndAfterAll$$super$run(SparkFunSuite.scala:56)
> 	at org.scalatest.BeforeAndAfterAll.liftedTree1$1(BeforeAndAfterAll.scala:213)
> 	at org.scalatest.BeforeAndAfterAll.run(BeforeAndAfterAll.scala:210)
> 	at org.scalatest.BeforeAndAfterAll.run$(BeforeAndAfterAll.scala:208)
> 	at org.apache.spark.SparkFunSuite.run(SparkFunSuite.scala:56)
> 	at org.scalatest.tools.Framework.org$scalatest$tools$Framework$$runSuite(Framework.scala:314)
> 	at org.scalatest.tools.Framework$ScalaTestTask.execute(Framework.scala:507)
> 	at sbt.ForkMain$Run$2.call(ForkMain.java:296)
> 	at sbt.ForkMain$Run$2.call(ForkMain.java:286)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> 	at java.lang.Thread.run(Thread.java:748) {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org