You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by ERSyrfw212oe <ER...@protonmail.ch.INVALID> on 2022/10/16 04:36:50 UTC

How to use neo4j cypher/opencypher to query spark RDD/graphdb

I think I saw GraphX here and there,is it a re-implementation of open cypher or is it a graphdb for spark?

I wanted to create a graphdb and query with cypher language,i looked around docs and didnt see any relevant guide.SO seems to be tackling specific problems.and I currently dont even know where to start, or even know if GQL/Cypher query is actually supported or not.

Re: How to use neo4j cypher/opencypher to query spark RDD/graphdb

Posted by Artemis User <ar...@dtechspace.com>.
Spark doesn't offer a native graph database like Neo4j does since GraphX 
is still using the RDD tabular data structure.  Spark doesn't have a GQL 
or Cypher query engine either, but uses Google's Pregal API for graph 
processing.  Don't see any prospect that Spark is going to implement any 
types of GQL engine in the near future.

Implementing a query engine would be a huge task and very difficult to 
optimize in general, especially when dealing with millions if not 
billions of vertices and edges.  Since any graph query would translate 
to APIs, your only option would be to use the graph APIs in Spark 
(currently only in Scala).  It isn't hard to learn and use at all, just 
requires different types of thinking/problem-solving...

On 10/16/22 12:36 AM, ERSyrfw212oe wrote:
> I think I saw GraphX here and there,is it a re-implementation of open 
> cypher or is it a graphdb for spark?
>
> I wanted to create a graphdb and query with cypher language,i looked 
> around docs and didnt see any relevant guide.SO seems to be tackling 
> specific problems.and I currently dont even know where to start, or 
> even know if GQL/Cypher query is actually supported or not.