You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2015/04/20 05:40:58 UTC

[jira] [Commented] (SPARK-7005) resetProb error in pagerank

    [ https://issues.apache.org/jira/browse/SPARK-7005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14502302#comment-14502302 ] 

Sean Owen commented on SPARK-7005:
----------------------------------

I don't think this is a bug. All it does is mean that the sum of all pageranks equals the number of vertices, not 1. But that's entirely valid too. Right?

> resetProb error in pagerank
> ---------------------------
>
>                 Key: SPARK-7005
>                 URL: https://issues.apache.org/jira/browse/SPARK-7005
>             Project: Spark
>          Issue Type: Bug
>          Components: MLlib
>    Affects Versions: 1.3.0
>            Reporter: lisendong
>              Labels: easyfix
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> in the page rank code, the resetProb should be divided by #vertex according to the wikipedia:
> http://en.wikipedia.org/wiki/PageRank
> that is: 
> PR[i] = alpha / N + (1 - alpha) * inNbrs[i].map(j => oldPR[j] / outDeg[j]).sum
> but the code is (org.apache.spark.graphx.lib.PageRank)
> PR[i] = alpha + (1 - alpha) * inNbrs[i].map(j => oldPR[j] / outDeg[j]).sum



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org