You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Greg Hogan (JIRA)" <ji...@apache.org> on 2017/07/11 14:15:01 UTC
[jira] [Closed] (FLINK-7019) Rework parallelism in Gelly algorithms
and examples
[ https://issues.apache.org/jira/browse/FLINK-7019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Greg Hogan closed FLINK-7019.
-----------------------------
Resolution: Implemented
master: d0cc2c178714987ba23998486651791d04a5beb1
> Rework parallelism in Gelly algorithms and examples
> ---------------------------------------------------
>
> Key: FLINK-7019
> URL: https://issues.apache.org/jira/browse/FLINK-7019
> Project: Flink
> Issue Type: Sub-task
> Components: Gelly
> Affects Versions: 1.4.0
> Reporter: Greg Hogan
> Assignee: Greg Hogan
> Priority: Minor
> Fix For: 1.4.0
>
>
> Flink job parallelism is set with {{ExecutionConfig#setParallelism}} or when {{-p}} on the command-line. The Gelly algorithms {{JaccardIndex}}, {{AdamicAdar}}, {{TriangleListing}}, and {{ClusteringCoefficient}} have intermediate operators which generate output quadratic in the size of input. These algorithms may need to be run with a high parallelism but doing so for all operations is wasteful. Thus was introduced "little parallelism".
> This can be simplified by moving the parallelism parameter to the new common base class with the rule-of-thumb to use the algorithm parallelism for all normal (small output) operators. The asymptotically large operators will default to the job parallelism, as will the default algorithm parallelism.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)