You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by M Singh <ma...@yahoo.com.INVALID> on 2017/12/30 22:40:08 UTC

Apache Spark - Using withWatermark for DataSets

Hi:
I am working with DataSets so that I can use mapGroupsWithState for business logic and then use dropDuplicates over a set of fields.  I would like to use the withWatermark so that I can restrict the how much state is stored.  
From the API it looks like withWatermark takes a string - timestamp column name as argument.  Is it possible to use it with DataSets ?  If not, is there any alternative like withWatermark available for working with DataSets ?
Thanks
Mans