You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "Kunal Kapoor (Jira)" <ji...@apache.org> on 2020/05/20 10:51:00 UTC

[jira] [Updated] (CARBONDATA-3617) loadDataUsingGlobalSort should based on SortColumns Instead Of Whole CarbonRow

     [ https://issues.apache.org/jira/browse/CARBONDATA-3617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kunal Kapoor updated CARBONDATA-3617:
-------------------------------------
    Fix Version/s:     (was: 2.0.0)
                       (was: 1.6.1)
                   2.1.0

> loadDataUsingGlobalSort should based on SortColumns Instead Of Whole CarbonRow
> ------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-3617
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3617
>             Project: CarbonData
>          Issue Type: Improvement
>          Components: data-load
>    Affects Versions: 1.6.1, 2.0.0
>            Reporter: Xingjun Hao
>            Priority: Minor
>             Fix For: 2.1.0
>
>          Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> During loading Data usesing globalsort, the sortby processing is based the whole carbon row, the overhead of gc is huge when there are many columns. Theoretically, the sortby processing can works well just based on the sort columns, which will brings less time overhead and gc overhead.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)