You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "bonnal-enzo (Jira)" <ji...@apache.org> on 2021/05/09 17:48:00 UTC

[jira] [Commented] (SPARK-35357) Allow to turn off the normalization applied by static PageRank utilities

    [ https://issues.apache.org/jira/browse/SPARK-35357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17341569#comment-17341569 ] 

bonnal-enzo commented on SPARK-35357:
-------------------------------------

Here is a pull request https://github.com/apache/spark/pull/32485

> Allow to turn off the normalization applied by static PageRank utilities
> ------------------------------------------------------------------------
>
>                 Key: SPARK-35357
>                 URL: https://issues.apache.org/jira/browse/SPARK-35357
>             Project: Spark
>          Issue Type: Improvement
>          Components: GraphX
>    Affects Versions: 3.1.1
>            Reporter: bonnal-enzo
>            Priority: Minor
>
> Since SPARK-18847, static PageRank computations available in `PageRank.scala` are normalizing the sum of the ranks after the fixed number of iterations has completed, and *there is no way for a developer to access the raw non normalized ranks values*.
> Since SPARK-29877 one can run a fixed number of PageRank iterations starting from previous `preRankGraph`'s ranks.
>  This nice feature open the door for interesting *incremental algorithms*, for example:
>  "Run some initial pagerank iterations using `PageRank.runWithOptions` and then update the graph's edges and update the ranks with a call to `PageRank.runWithOptionsWithPreviousPageRank`, and so on...".
> This kind of algorithms would highly benefit (precision gain) from being allowed to manipulate directly the raw ranks (and not the normalized ones) in the case where the graph has a substantial proportion of sinks (vertices without outgoing edges).
> It would be nice to add a method's signature having a boolean that allows to turn off the automatic normalization run at the end of `PageRank.runWithOptions` and `PageRank.runWithOptionsWithPreviousPageRank`, making the developers free to apply the normalization only when they really need it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org