You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by "Sela, Amit" <AN...@paypal.com.INVALID> on 2015/07/06 09:05:36 UTC

How does Spark streaming move data around ?

I know that Spark is using data parallelism over, say, HDFS - optimally running computations on local data (aka data locality).
I was wondering how Spark streaming moves data (messages) around? since the data is streamed in as DStreams and is not on a distributed FS like HDFS.

Thanks!