You are viewing a plain text version of this content. The canonical link for it is here.

Posted to notifications@asterixdb.apache.org by "Gift Sinthong (JIRA)" <ji...@apache.org> on 2018/11/14 23:22:00 UTC

[jira] [Created] (ASTERIXDB-2481) Out of Memory error doing aggregation

Gift Sinthong created ASTERIXDB-2481:
----------------------------------------

             Summary: Out of Memory error doing aggregation
                 Key: ASTERIXDB-2481
                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2481
             Project: Apache AsterixDB
          Issue Type: Bug
          Components: COMP - Compiler, RT - Runtime, SQL - Translator SQL++
    Affects Versions: 0.9.5
         Environment: Linux
            Reporter: Gift Sinthong
         Attachments: Screen Shot 2018-11-14 at 3.12.31 PM.png

This is the schema for this query:

CREATE TYPE Test AS open{
 unique2: int64
};

CREATE DATASET wisconsin_1gb(Test)
 PRIMARY KEY unique2;



This is the query:
SELECT min( t.oddOnePercent) as min, max(t.oddOnePercent) as max, count(distinct t.oddOnePercent) as cnt
 FROM wisconsin_5gb t ;

 

The plan for this query:
distribute result [$$46]
-- DISTRIBUTE_RESULT |UNPARTITIONED|
 exchange
 -- ONE_TO_ONE_EXCHANGE |UNPARTITIONED|
 project ([$$46])
 -- STREAM_PROJECT |UNPARTITIONED|
 assign [$$46] <- [\{"min": $$48, "max": $$49, "cnt": $$50}]
 -- ASSIGN |UNPARTITIONED|
 project ([$$48, $$49, $$50])
 -- STREAM_PROJECT |UNPARTITIONED|
 subplan {
 aggregate [$$50] <- [agg-sql-sum($$53)]
 -- AGGREGATE |LOCAL|
 aggregate [$$53] <- [agg-sql-count($$43)]
 -- AGGREGATE |LOCAL|
 distinct ([$$43])
 -- MICRO_PRE_SORTED_DISTINCT_BY |LOCAL|
 order (ASC, $$43) 
 -- IN_MEMORY_STABLE_SORT [$$43(ASC)] |LOCAL|
 assign [$$43] <- [$$52.getField("oddOnePercent")]
 -- ASSIGN |UNPARTITIONED|
 assign [$$52] <- [$#4.getField(0)]
 -- ASSIGN |UNPARTITIONED|
 unnest $#4 <- scan-collection($$28)
 -- UNNEST |UNPARTITIONED|
 nested tuple source
 -- NESTED_TUPLE_SOURCE |UNPARTITIONED|
 }
 -- SUBPLAN |UNPARTITIONED|
 aggregate [$$28, $$48, $$49] <- [listify($$27), agg-sql-min($$33), agg-sql-max($$33)]
 -- AGGREGATE |UNPARTITIONED|
 exchange
 -- RANDOM_MERGE_EXCHANGE |PARTITIONED|
 project ([$$27, $$33])
 -- STREAM_PROJECT |PARTITIONED|
 assign [$$33, $$27] <- [$$t.getField("oddOnePercent"), \{"t": $$t}]
 -- ASSIGN |PARTITIONED|
 project ([$$t])
 -- STREAM_PROJECT |PARTITIONED|
 exchange
 -- ONE_TO_ONE_EXCHANGE |PARTITIONED|
 data-scan []<-[$$47, $$t] <- benchmark.wisconsin_5gb
 -- DATASOURCE_SCAN |PARTITIONED|
 exchange
 -- ONE_TO_ONE_EXCHANGE |PARTITIONED|
 empty-tuple-source
 -- EMPTY_TUPLE_SOURCE |PARTITIONED|



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)