You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Teddy Choi (JIRA)" <ji...@apache.org> on 2017/08/18 18:09:00 UTC

[jira] [Updated] (ORC-210) Add encoding for Double, Float.

     [ https://issues.apache.org/jira/browse/ORC-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Teddy Choi updated ORC-210:
---------------------------
    Attachment: ORC-210.1.patch

This patch file includes a new modified FPC implementation in ASLv2, some unit tests, benchmarks, realistic test data sets including TPC-DS and IoT meter. It shows 40-80% compression ratio.

I will do following things further;
- Implement a C++ reader
- Sync ORC version information with ORC-209, Decimal64
- Separate plain double encoder/decoder as DoubleReaderV1/DoubleWriterV1
- Make a pull request in GitHub
- Add a decoder benchmark

> Add encoding for Double, Float.
> -------------------------------
>
>                 Key: ORC-210
>                 URL: https://issues.apache.org/jira/browse/ORC-210
>             Project: ORC
>          Issue Type: Improvement
>          Components: encoding, Java
>    Affects Versions: 1.5.0
>            Reporter: Dapeng Sun
>            Assignee: Teddy Choi
>         Attachments: ORC-210.1.patch, patch.txt
>
>
> Currently, Double and Float are using PLAIN encoding, it is better to support encoding such as Dictionary or BitPacking to reduce the storage cost.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)