You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Larry Xiao (JIRA)" <ji...@apache.org> on 2014/08/05 04:16:13 UTC

[jira] [Commented] (SPARK-1153) Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.

    [ https://issues.apache.org/jira/browse/SPARK-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14085676#comment-14085676 ] 

Larry Xiao commented on SPARK-1153:
-----------------------------------

I like npanj's approach.
It's universal. You treat UUID as attribute.

Like the procedure from http://spark.apache.org/docs/latest/graphx-programming-guide.html

// Connect to the Spark cluster
== Build Graph (build VertexID if necessary)
// Load my user data and parse into tuples of user id and attribute list
// Parse the edge data which is already in userId -> userId format
// Attach the user attributes
== Clean Graph
// Some users may not have attributes so we set them as empty, Restrict the graph to users with usernames and names
== Compute
// Compute the PageRank
== Get Result
// Get the attributes of the top pagerank users

> Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.
> ----------------------------------------------------------------------
>
>                 Key: SPARK-1153
>                 URL: https://issues.apache.org/jira/browse/SPARK-1153
>             Project: Spark
>          Issue Type: Improvement
>          Components: GraphX
>    Affects Versions: 0.9.0
>            Reporter: Deepak Nulu
>
> Currently, {{VertexId}} is a type-synonym for {{Long}}. I would like to be able to use {{UUID}} as the vertex ID type because the data I want to process with GraphX uses that type for its primay-keys. Others might have a different type for their primary-keys. Generalizing {{VertexId}} (with a type class) will help in such cases.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org