You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Jungtaek Lim (Jira)" <ji...@apache.org> on 2021/08/11 12:53:00 UTC

[jira] [Created] (SPARK-36480) SessionWindowStateStoreSaveExec should not filter input rows against watermark

Jungtaek Lim created SPARK-36480:
------------------------------------

             Summary: SessionWindowStateStoreSaveExec should not filter input rows against watermark
                 Key: SPARK-36480
                 URL: https://issues.apache.org/jira/browse/SPARK-36480
             Project: Spark
          Issue Type: Bug
          Components: Structured Streaming
    Affects Versions: 3.2.0
            Reporter: Jungtaek Lim


SessionWindowStateStoreSaveExec receives all sessions including existing sessions into input rows and stores as they are. That said, we should not filter out input rows before storing into state store, but we do. 

Fortunately it hasn't showed any actual problem due to the nature how we deal with watermark against micro-batch and it seems hard to come up with the broken case, but it should be better to fix it before someone succeeds to touch the possible edge case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org