You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "SHREELEKHYA GAMPA (Jira)" <ji...@apache.org> on 2022/04/05 05:24:00 UTC

[jira] [Created] (CARBONDATA-4330) Incremental‌ ‌Dataload‌ ‌of Average aggregate in ‌MV‌‌

SHREELEKHYA GAMPA created CARBONDATA-4330:
---------------------------------------------

             Summary:  Incremental‌ ‌Dataload‌ ‌of Average aggregate in ‌MV‌‌
                 Key: CARBONDATA-4330
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-4330
             Project: CarbonData
          Issue Type: Improvement
            Reporter: SHREELEKHYA GAMPA


Currently, whenever MV is created with average aggregate, a full refresh is done meaning it reloads the whole MV for any newly added segments. This will slow down the loading. With incremental data load, only the segments that are newly added can be loaded to the MV.
If avg is present, rewrite the query with the sum and count of the columns to create MV and use them to derive avg.
Refer: https://docs.google.com/document/d/1kPEMCX50FLZcmyzm6kcIQtUH9KXWDIqh-Hco7NkTp80/edit




--
This message was sent by Atlassian Jira
(v8.20.1#820001)