You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by mshiryae <mi...@intel.com> on 2016/07/05 11:07:57 UTC
Spark MLlib: network intensive algorithms
Hi,
I have a question wrt ML algorithms.
What are the most network intensive algorithms in Spark MLlib?
I have already looked at ALS (as pointed here:
https://databricks.com/blog/2014/07/23/scalable-collaborative-filtering-with-spark-mllib.html
ALS is pretty communication and computation intensive but in the latest
Spark ALS is pretty optimized in terms of communications).
As far as I understand ML algorithm do many computations on node and
periodically perform small exchanges between nodes.
May be you know such examples when algorithm is network-bounded or when
network exchanges consume noticeable time in training time?
--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-MLlib-network-intensive-algorithms-tp27287.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org