You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2017/07/14 06:17:00 UTC

[jira] [Assigned] (SPARK-21414) Buffer in SlidingWindowFunctionFrame could be big though window is small

     [ https://issues.apache.org/jira/browse/SPARK-21414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-21414:
------------------------------------

    Assignee: Apache Spark

> Buffer in SlidingWindowFunctionFrame could be big though window is small
> ------------------------------------------------------------------------
>
>                 Key: SPARK-21414
>                 URL: https://issues.apache.org/jira/browse/SPARK-21414
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.1.1
>            Reporter: jin xing
>            Assignee: Apache Spark
>
> In {{SlidingWindowFunctionFrame}}, it is now adding all rows to the buffer for which the input row value is equal to or less than the output row upper bound, then drop all rows from the buffer for which the input row value is smaller than the output row lower bound.
> This could result in the buffer is very big though the window is small. 
> For example: 
> {{select a, b, sum(a) over (partition by b order by a range between 1000000 following and 1000001 following) from table}}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org