You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "SHREELEKHYA GAMPA (Jira)" <ji...@apache.org> on 2022/04/05 05:24:00 UTC
[jira] [Created] (CARBONDATA-4330) Incremental Dataload of Average aggregate in MV
SHREELEKHYA GAMPA created CARBONDATA-4330:
---------------------------------------------
Summary: Incremental Dataload of Average aggregate in MV
Key: CARBONDATA-4330
URL: https://issues.apache.org/jira/browse/CARBONDATA-4330
Project: CarbonData
Issue Type: Improvement
Reporter: SHREELEKHYA GAMPA
Currently, whenever MV is created with average aggregate, a full refresh is done meaning it reloads the whole MV for any newly added segments. This will slow down the loading. With incremental data load, only the segments that are newly added can be loaded to the MV.
If avg is present, rewrite the query with the sum and count of the columns to create MV and use them to derive avg.
Refer: https://docs.google.com/document/d/1kPEMCX50FLZcmyzm6kcIQtUH9KXWDIqh-Hco7NkTp80/edit
--
This message was sent by Atlassian Jira
(v8.20.1#820001)