You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2022/04/17 12:34:00 UTC

[jira] [Assigned] (IMPALA-10956) datasketches UDFS: memory leak and merge overhead

     [ https://issues.apache.org/jira/browse/IMPALA-10956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Quanlong Huang reassigned IMPALA-10956:
---------------------------------------

    Assignee: Alexander Saydakov

> datasketches UDFS: memory leak and merge overhead
> -------------------------------------------------
>
>                 Key: IMPALA-10956
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10956
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: Impala 4.0.0
>            Reporter: Alexander Saydakov
>            Assignee: Alexander Saydakov
>            Priority: Minor
>
> I believe that there are memory leaks in aggregate UDFs that are using Apache Datasketches. Sketch or union objects are placed in a buffer to maintain the state of the aggregation, and later deallocated without calling destructors for those objects.
> Also there is unnecessary overhead during the merge phase of some of those UDFs, when an instance of a union is created for every merge operation and finalized by calling a quite expensive get_result() every time as well. This second issue is about performance, not a bug.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org