You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "Ajantha Bhat (Jira)" <ji...@apache.org> on 2020/08/06 09:33:00 UTC

[jira] [Resolved] (CARBONDATA-3879) Filtering Segmets Optimazation

     [ https://issues.apache.org/jira/browse/CARBONDATA-3879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ajantha Bhat resolved CARBONDATA-3879.
--------------------------------------
    Fix Version/s:     (was: 2.0.2)
                   2.1.0
       Resolution: Fixed

> Filtering Segmets Optimazation
> ------------------------------
>
>                 Key: CARBONDATA-3879
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3879
>             Project: CarbonData
>          Issue Type: Improvement
>          Components: data-query
>    Affects Versions: 2.0.0
>            Reporter: Xingjun Hao
>            Priority: Major
>             Fix For: 2.1.0
>
>          Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> During filter segments flow, there are a lot of LIST.CONTAINS, which has heavy time overhead when there are tens of thousands segments.
> For example, if there are 50000 segments. it will trigger LIST.CONTAINS  for each segment, the LIST also has about 50000 elements. so the time complexity will be O(50000 * 50000 )



--
This message was sent by Atlassian Jira
(v8.3.4#803005)