You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@systemml.apache.org by "Matthias Boehm (JIRA)" <ji...@apache.org> on 2017/03/10 20:17:04 UTC

[jira] [Closed] (SYSTEMML-1390) Avoid unnecessary caching of parfor spark datapartition-execute input

     [ https://issues.apache.org/jira/browse/SYSTEMML-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matthias Boehm closed SYSTEMML-1390.
------------------------------------

> Avoid unnecessary caching of parfor spark datapartition-execute input
> ---------------------------------------------------------------------
>
>                 Key: SYSTEMML-1390
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1390
>             Project: SystemML
>          Issue Type: Sub-task
>          Components: APIs, Runtime
>            Reporter: Matthias Boehm
>            Assignee: Matthias Boehm
>             Fix For: SystemML 1.0
>
>
> This task aims to avoid unnecessary input caching for parfor spark datapartition-execute jobs (with grouping) in order to reduce the memory pressure and thus garbage collection overhead during shuffle and subsequent execution. We only apply this for the general case with grouping and if the input is a persisted rdd which has not been cached yet.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)