You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Denny Lee (JIRA)" <ji...@apache.org> on 2016/11/01 15:35:58 UTC

[jira] [Created] (SPARK-18200) GraphX Invalid initial capacity when running triangleCount

Denny Lee created SPARK-18200:
---------------------------------

             Summary: GraphX Invalid initial capacity when running triangleCount
                 Key: SPARK-18200
                 URL: https://issues.apache.org/jira/browse/SPARK-18200
             Project: Spark
          Issue Type: Bug
          Components: GraphX
    Affects Versions: 2.0.1, 2.0.0, 2.0.2
         Environment: Databricks, Ubuntu 16.04, macOS Sierra
            Reporter: Denny Lee


Running GraphX triangle count on large-ish file results in the "Invalid initial capacity" error when running on Spark 2.0 (tested on Spark 2.0, 2.0.1, and 2.0.2).  You can see the results at: http://bit.ly/2eQKWDN

Running the same code on Spark 1.6 and the query completes without any problems: http://bit.ly/2fATO1M

As well, running the GraphFrames version of this code runs as well (Spark 2.0, GraphFrames 0.2): http://bit.ly/2fAS8W8

Reference Stackoverflow question:
Spark GraphX: requirement failed: Invalid initial capacity (http://stackoverflow.com/questions/40337366/spark-graphx-requirement-failed-invalid-initial-capacity)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org