You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Jungtaek Lim (Jira)" <ji...@apache.org> on 2022/10/23 23:37:00 UTC
[jira] [Created] (SPARK-40892) Loosen the requirement of window_time rule - allow multiple window_time calls
Jungtaek Lim created SPARK-40892:
------------------------------------
Summary: Loosen the requirement of window_time rule - allow multiple window_time calls
Key: SPARK-40892
URL: https://issues.apache.org/jira/browse/SPARK-40892
Project: Spark
Issue Type: Improvement
Components: Structured Streaming
Affects Versions: 3.4.0
Reporter: Jungtaek Lim
SPARK-40821 introduces a new SQL function "window_time" to extract the representative time from window (which also carries over the event time metadata as well if feasible).
SPARK-40821 followed the existing rule of time window / session window which only allows a single function call in a same projection (strictly saying, it considers the call of function as once if the function is called with same parameters).
For existing rules, the restriction makes sense since allowing this would produce cartesian product of rows (although Spark can handle it). But given that window_time only produces one value, the restriction no longer makes sense.
It would be better to unlock the functionality. Note that this means the resulting column of "window_time()" is no longer be "window_time".
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org