You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sdap.apache.org by "Joseph Jacob (JIRA)" <ji...@apache.org> on 2018/09/20 21:23:00 UTC
[jira] [Created] (SDAP-151) Determine parallelism automatically for
Spark analytics
Joseph Jacob created SDAP-151:
---------------------------------
Summary: Determine parallelism automatically for Spark analytics
Key: SDAP-151
URL: https://issues.apache.org/jira/browse/SDAP-151
Project: Apache Science Data Analytics Platform
Issue Type: Improvement
Reporter: Joseph Jacob
Some of the built-in NEXUS analytics like TimeSeries and TimeAvgMap currently get the desired parallelism from a job request parameter like "spark=mesos,16,32". If that is omitted, we currently default to "spark=local,1,1", which runs on a single core. Instead we would like to automatically determine the appropriate level of parallelism based on the job's input data size.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)