You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by pratik4891 <pr...@gmail.com> on 2019/10/14 08:07:49 UTC

Spark pipeRDD vs ML

What I was wondering after reading about spark pipe RDD is that we can
execute any python code (including machine learning ) . The code is going to
execute in distributed manner as well.

So if we can run machine learning code in distributed manner with pipeRDD
what's the usefulness of Spark ML. Is there anything fundamental difference
between running a python ML code via spark pipeRDD vs Spark ML.



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org