You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/04/27 15:50:04 UTC
[jira] [Commented] (FLINK-6388) Add support for DISTINCT into Code
Generated Aggregations
[ https://issues.apache.org/jira/browse/FLINK-6388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15986851#comment-15986851 ]
ASF GitHub Bot commented on FLINK-6388:
---------------------------------------
Github user fhueske commented on the issue:
https://github.com/apache/flink/pull/3783
Thanks for this PR @huawei-flink!
I think I made a mistake when I suggested to use the code-gen'd functions with registered `MapState` to compute distinct window aggregations. Originally, I thought it would be possible to register state (i.e., the `MapState` for the distinct values) in an `AggregateFunction` (which is used for the grouped window aggregates). However, that's unfortunately not possible as I learned today. All state of an `AggregateFunction` must be contained in the accumulator.
What does this mean? We cannot use the current approach of registering `MapState` in the code-gen'd function for group windowed aggregates. So we would need another approach for that.
However, we can still use your code for distinct over windows (`ProcessFunction` can obviously register state) once the API supports to define DISTINCT aggregates.
I'll try to have a closer look at this PR soon.
Best, Fabian
> Add support for DISTINCT into Code Generated Aggregations
> ---------------------------------------------------------
>
> Key: FLINK-6388
> URL: https://issues.apache.org/jira/browse/FLINK-6388
> Project: Flink
> Issue Type: Sub-task
> Components: DataStream API
> Affects Versions: 1.3.0
> Reporter: Stefano Bortoli
> Assignee: Stefano Bortoli
> Fix For: 1.3.0
>
>
> We should support DISTINCT in Code Generated aggrgation functions.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)