You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@carbondata.apache.org by QiangCai <qi...@qq.com> on 2017/04/05 10:31:32 UTC

[DISCUSSION]implement delta encoding for numeric type column in SORT_COLUMNS

Hi all,

    Now we plan to implement delta encoding for the numeric type column in
SORT_COLUMNS. 
    
    1. use delta encoding to encode the numeric type data

    2. write present metadata to the page header, to record the null value.

    3. improve the compression on no-dictionary string column.
        use RLE to compress the array of the lengths in LV encoding.

    Any thoughts, comments and questions ?

Best Regards
David QiangCai



--
View this message in context: http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/DISCUSSION-implement-delta-encoding-for-numeric-type-column-in-SORT-COLUMNS-tp10124.html
Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.

Re: [DISCUSSION]implement delta encoding for numeric type column in SORT_COLUMNS

Posted by Liang Chen <ch...@gmail.com>.
Hi David

Thanks for your starting this new feature's discussion.
Can you explain what are the major benefits after doing delta encoding for
the numeric type column.

Regards
Liang

2017-04-05 16:01 GMT+05:30 QiangCai <qi...@qq.com>:

> Hi all,
>
>     Now we plan to implement delta encoding for the numeric type column in
> SORT_COLUMNS.
>
>     1. use delta encoding to encode the numeric type data
>
>     2. write present metadata to the page header, to record the null value.
>
>     3. improve the compression on no-dictionary string column.
>         use RLE to compress the array of the lengths in LV encoding.
>
>     Any thoughts, comments and questions ?
>
> Best Regards
> David QiangCai
>
>
>
> --
> View this message in context: http://apache-carbondata-
> mailing-list-archive.1130556.n5.nabble.com/DISCUSSION-
> implement-delta-encoding-for-numeric-type-column-in-SORT-
> COLUMNS-tp10124.html
> Sent from the Apache CarbonData Mailing List archive mailing list archive
> at Nabble.com.
>



-- 
Regards
Liang

Re: [DISCUSSION]implement delta encoding for numeric type column in SORT_COLUMNS

Posted by Jacky Li <ja...@qq.com>.
> 在 2017年4月5日,下午6:31,QiangCai <qi...@qq.com> 写道:
> 
> Hi all,
> 
>    Now we plan to implement delta encoding for the numeric type column in
> SORT_COLUMNS. 
> 
>    1. use delta encoding to encode the numeric type data
> 

I think the adaptive data type conversion still apply here, right?

>    2. write present metadata to the page header, to record the null value.
> 
>    3. improve the compression on no-dictionary string column.
>        use RLE to compress the array of the lengths in LV encoding.
> 

I think local dictionary is more suitable for this case

>    Any thoughts, comments and questions ?
> 
> Best Regards
> David QiangCai
> 
> 
> --
> View this message in context: http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/DISCUSSION-implement-delta-encoding-for-numeric-type-column-in-SORT-COLUMNS-tp10124.html
> Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.