You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Alexander Saydakov (Jira)" <ji...@apache.org> on 2021/10/07 21:52:00 UTC

[jira] [Updated] (IMPALA-10956) datasketches UDFS: memory leak and merge overhead

     [ https://issues.apache.org/jira/browse/IMPALA-10956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexander Saydakov updated IMPALA-10956:
----------------------------------------
    Affects Version/s:     (was: Impala 3.4.0)
                       Impala 4.0.0

> datasketches UDFS: memory leak and merge overhead
> -------------------------------------------------
>
>                 Key: IMPALA-10956
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10956
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: Impala 4.0.0
>            Reporter: Alexander Saydakov
>            Priority: Minor
>
> I believe that there are memory leaks in aggregate UDFs that are using Apache Datasketches. Sketch or union objects are placed in a buffer to maintain the state of the aggregation, and later deallocated without calling destructors for those objects.
> Also there is unnecessary overhead during the merge phase of some of those UDFs, when an instance of a union is created for every merge operation and finalized by calling a quite expensive get_result() every time as well. This second issue is about performance, not a bug.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org