You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@sdap.apache.org by "Joseph Jacob (JIRA)" <ji...@apache.org> on 2018/09/20 21:23:00 UTC

[jira] [Created] (SDAP-151) Determine parallelism automatically for Spark analytics

Joseph Jacob created SDAP-151:
---------------------------------

             Summary: Determine parallelism automatically for Spark analytics
                 Key: SDAP-151
                 URL: https://issues.apache.org/jira/browse/SDAP-151
             Project: Apache Science Data Analytics Platform
          Issue Type: Improvement
            Reporter: Joseph Jacob


Some of the built-in NEXUS analytics like TimeSeries and TimeAvgMap currently get the desired parallelism from a job request parameter like "spark=mesos,16,32".  If that is omitted, we currently default to "spark=local,1,1", which runs on a single core.  Instead we would like to automatically determine the appropriate level of parallelism based on the job's input data size.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)