You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by soumick dasgupta <so...@gmail.com> on 2016/07/11 16:24:22 UTC

Marking files as read in Spark Streaming

Hi,

I am looking for a solution in Spark Streaming where I can mark the files
that I have already read in HDFS. This is to make sure that I am not
reading the same file by mistake and also to ensure that I have read all
the records in a given file.

Thank You,

Soumick