You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "Akash R Nilugal (JIRA)" <ji...@apache.org> on 2019/05/06 08:39:00 UTC
[jira] [Commented] (CARBONDATA-3296) Support incremental dataload
to datamap and other mv datamap enhancements
[ https://issues.apache.org/jira/browse/CARBONDATA-3296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833601#comment-16833601 ]
Akash R Nilugal commented on CARBONDATA-3296:
---------------------------------------------
Design document is updated at https://docs.google.com/document/d/1AACOYmBpwwNdHjJLOub0utSc6JCBMZn8VL5CvZ9hygA/edit
> Support incremental dataload to datamap and other mv datamap enhancements
> -------------------------------------------------------------------------
>
> Key: CARBONDATA-3296
> URL: https://issues.apache.org/jira/browse/CARBONDATA-3296
> Project: CarbonData
> Issue Type: Bug
> Reporter: Akash R Nilugal
> Priority: Major
>
> Currently in carbondata we have datamaps like preaggregate, lucene, bloom,
> mv and we have
> lazy and non-lazy methods to load data to datamaps. But lazy load is not
> allowed for datamaps
> like preagg, lucene, bloom.but, it is allowed for mv datamap. In lazy load
> of mv datamap, for
> every rebuild(load to datamap) we load the complete data of main table and
> overwrite the existing
> segment in datamap based on datamap query.
> This is very costly in terms of performance and we also need to support the
> lazy and non-lazy load
> for all the datamaps. This can help in reduce the actual dataload time to
> main table and whenever
> user wants, he can do the lazy load for the datamaps present for that table.
> Basically we need not overwrite the existing data every time we load to
> datamap, we need to increment
> the data in new segments similar to main table. This will help to get
> better performance.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)