You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "RJ Nowling (JIRA)" <ji...@apache.org> on 2014/08/27 22:26:58 UTC
[jira] [Created] (SPARK-3263) PR #720 broke
GraphGenerator.logNormal
RJ Nowling created SPARK-3263:
---------------------------------
Summary: PR #720 broke GraphGenerator.logNormal
Key: SPARK-3263
URL: https://issues.apache.org/jira/browse/SPARK-3263
Project: Spark
Issue Type: Bug
Components: GraphX
Reporter: RJ Nowling
PR #720 made multiple changes to GraphGenerator.logNormalGraph including:
* Replacing the call to functions for generating random vertices and edges with in-line implementations with different equations
* Hard-coding of RNG seeds so that method now generates the same graph for a given number of vertices, edges, mu, and sigma -- user is not able to override seed or specify that seed should be randomly generated.
* Backwards-incompatible change to logNormalGraph signature with introduction of new required parameter.
* Failed to update scala docs and programming guide for API changes
I also see that PR #720 added a Synthetic Benchmark in the examples.
Based on reading the Pregel paper, I believe the in-line functions are incorrect. I proposed to:
* Removing the in-line calls
* Adding a seed for deterministic behavior (when desired)
* Keeping the number of partitions parameter.
* Updating the synthetic benchmark example
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org