You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by mshiryae <mi...@intel.com> on 2016/07/05 11:07:57 UTC

Spark MLlib: network intensive algorithms

Hi,

I have a question wrt ML algorithms.
What are the most network intensive algorithms in Spark MLlib?

I have already looked at ALS (as pointed here:
https://databricks.com/blog/2014/07/23/scalable-collaborative-filtering-with-spark-mllib.html
ALS is pretty communication and computation intensive but in the latest
Spark ALS is pretty optimized in terms of communications).

As far as I understand ML algorithm do many computations on node and
periodically perform small exchanges between nodes.

May be you know such examples when algorithm is network-bounded or when
network exchanges consume noticeable time in training time?



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-MLlib-network-intensive-algorithms-tp27287.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org