You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Anish Shrigondekar (Jira)" <ji...@apache.org> on 2024/01/23 18:36:00 UTC

[jira] [Updated] (SPARK-45939) SPIP: Structured Streaming - Arbitrary State API v2

     [ https://issues.apache.org/jira/browse/SPARK-45939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Anish Shrigondekar updated SPARK-45939:
---------------------------------------
    Epic Link: SPARK-46815

> SPIP: Structured Streaming - Arbitrary State API v2
> ---------------------------------------------------
>
>                 Key: SPARK-45939
>                 URL: https://issues.apache.org/jira/browse/SPARK-45939
>             Project: Spark
>          Issue Type: New Feature
>          Components: Structured Streaming
>    Affects Versions: 4.0.0
>            Reporter: Anish Shrigondekar
>            Priority: Major
>
> SPIP: Structured Streaming - Arbitrary State API v2
>  
> We are planning to introduce a new operator for [Spark Structured Streaming|https://spark.apache.org/streaming/] around [arbitrary stateful operations|https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html#arbitrary-stateful-operations]. The operator(s) we have today ([mapGroupsWithState/flatMapGroupsWithState|https://spark.apache.org/docs/3.1.3/api/java/org/apache/spark/sql/streaming/GroupState.html]) have a bunch of limitations (as noted in Q3 below) that prevent the use of the APIs for complex/advanced use-cases for stateful stream processing. We plan to remove these limitations with the introduction of a new API to handle state manipulation for streaming queries.
>  
> As part of this effort, we want to address:
>  * composite types
>  * multiple state variables (flexible data modeling)
>  * state TTL
>  * callback/timer improvements
>  * schema evolution
>  * initial state
>  * side outputs
>  * output modes
>  * co-process functions
>  * chaining operators after new operator etc



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org