You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Orhan Kislal (JIRA)" <ji...@apache.org> on 2019/06/14 18:34:00 UTC

[jira] [Created] (MADLIB-1361) Pivot: Fix array_agg + distinct scaling issue on gpdb

Orhan Kislal created MADLIB-1361:
------------------------------------

             Summary: Pivot: Fix array_agg + distinct scaling issue on gpdb 
                 Key: MADLIB-1361
                 URL: https://issues.apache.org/jira/browse/MADLIB-1361
             Project: Apache MADlib
          Issue Type: Bug
          Components: Module: Utilities
            Reporter: Orhan Kislal


With large datasets, pivot fails because of the array_agg(distinct) query. This is because array_agg collects the values first and filters the distinct values later. This causes the array_agg to go out of memory.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)