You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Andrew Ash (JIRA)" <ji...@apache.org> on 2014/11/12 06:52:34 UTC

[jira] [Commented] (SPARK-689) Task will crash when setting SPARK_WORKER_CORES> 128

    [ https://issues.apache.org/jira/browse/SPARK-689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207721#comment-14207721 ] 

Andrew Ash commented on SPARK-689:
----------------------------------

I attempted a repro on a one-node cluster (my laptop) and confirmed that this bug no longer exists.  A code inspection reveals that there is no thread limit of 128 limit anymore on the Executor's threadpool from this stacktrace line: {{at spark.executor.Executor.launchTask(Executor.scala:59)}}

Here's the outline of my repro attempt:

{noformat}
aash@aash-mbp ~/git/spark$ cat conf/spark-env.sh
SPARK_WORKER_CORES=200
SPARK_MASTER_IP=aash-mbp.local
SPARK_PUBLIC_DNS=aash-mbp.local
aash@aash-mbp ~/git/spark$ cat conf/spark-defaults.sh
spark.master                     spark://aash-mbp.local:7077
aash@aash-mbp ~/git/spark$ cat conf/spark-defaults.sh
aash@aash-mbp ~/git/spark$ sbin/start-all.sh
...
aash@aash-mbp ~/git/spark$ bin/spark-shell
spark> sc.parallelize(1l to 100000000l,200).reduce(_+_)
res0: Long = 5000000050000000
spark>
{noformat}

I'm now closing this ticket, but please reopen [~xiajunluan] if you're still having issues.

> Task will crash when setting SPARK_WORKER_CORES> 128
> ----------------------------------------------------
>
>                 Key: SPARK-689
>                 URL: https://issues.apache.org/jira/browse/SPARK-689
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 0.6.1
>            Reporter: xiajunluan
>
> when I set SPARK_WORKER_CORES > 128(for example 200), and run a job in standalone mode that will allocate 200 tasks in one worker node, then task will crash(it seems that worker cores has been hard-code)
> {noformat}
> 13/02/07 11:25:02 ERROR StandaloneExecutorBackend: Task spark.executor.Executor$TaskRunner@5367839e rejected from java.util.concurrent.ThreadPoolExecutor@30f224d9[Running, pool size = 128, active threads = 128, queued tasks = 0, completed tasks = 0]
> java.util.concurrent.RejectedExecutionException: Task spark.executor.Executor$TaskRunner@5367839e rejected from java.util.concurrent.ThreadPoolExecutor@30f224d9[Running, pool size = 128, active threads = 128, queued tasks = 0, completed tasks = 0]
> 	at java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:2013)
> 	at java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:816)
> 	at java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:1337)
> 	at spark.executor.Executor.launchTask(Executor.scala:59)
> 	at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:57)
> 	at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:46)
> 	at akka.actor.Actor$class.apply(Actor.scala:318)
> 	at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:17)
> 	at akka.actor.ActorCell.invoke(ActorCell.scala:626)
> 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197)
> 	at akka.dispatch.Mailbox.run(Mailbox.scala:179)
> 	at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516)
> 	at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259)
> 	at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975)
> 	at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
> 	at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
> 13/02/07 11:25:02 INFO StandaloneExecutorBackend: Connecting to master: akka://spark@10.0.2.19:60882/user/StandaloneScheduler
> 13/02/07 11:25:02 INFO StandaloneExecutorBackend: Got assigned task 1929
> 13/02/07 11:25:02 INFO Executor: launch taskId: 1929
> 13/02/07 11:25:02 ERROR StandaloneExecutorBackend: 
> java.lang.NullPointerException
> 	at spark.executor.Executor.launchTask(Executor.scala:59)
> 	at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:57)
> 	at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:46)
> 	at akka.actor.Actor$class.apply(Actor.scala:318)
> 	at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:17)
> 	at akka.actor.ActorCell.invoke(ActorCell.scala:626)
> 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197)
> 	at akka.dispatch.Mailbox.run(Mailbox.scala:179)
> 	at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516)
> 	at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259)
> 	at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975)
> 	at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
> 	at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
> 13/02/07 11:25:02 INFO StandaloneExecutorBackend: Connecting to master: akka://spark@10.0.2.19:60882/user/StandaloneScheduler
> 13/02/07 11:25:02 INFO StandaloneExecutorBackend: Got assigned task 1930
> 13/02/07 11:25:02 INFO Executor: launch taskId: 1930
> 13/02/07 11:25:02 ERROR StandaloneExecutorBackend: 
> java.lang.NullPointerException
> 	at spark.executor.Executor.launchTask(Executor.scala:59)
> 	at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:57)
> 	at spark.executor.StandaloneExecutorBackend$$anonfun$receive$1.apply(StandaloneExecutorBackend.scala:46)
> 	at akka.actor.Actor$class.apply(Actor.scala:318)
> 	at spark.executor.StandaloneExecutorBackend.apply(StandaloneExecutorBackend.scala:17)
> 	at akka.actor.ActorCell.invoke(ActorCell.scala:626)
> 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197)
> 	at akka.dispatch.Mailbox.run(Mailbox.scala:179)
> 	at akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516)
> 	at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259)
> 	at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975)
> 	at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
> 	at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org