You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Chen Song <ch...@gmail.com> on 2014/06/25 21:23:03 UTC

semi join spark streaming

Is there a easy way to do semi join in spark streaming?

Here is my problem briefly, I have a DStream that will generate a set of
values. I would like to check the existence in this set in other DStreams.

Is there a easy and standard way to model this problem. If not, can I write
spark streaming job to load the set of values from disk and cache to each
worker?

-- 
Chen Song