You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@orc.apache.org by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2017/10/13 00:09:00 UTC

[jira] [Created] (ORC-250) Create sha256 mask

Owen O'Malley created ORC-250:
---------------------------------

             Summary: Create sha256 mask
                 Key: ORC-250
                 URL: https://issues.apache.org/jira/browse/ORC-250
             Project: ORC
          Issue Type: Sub-task
            Reporter: Owen O'Malley


We should also create a DataMask that does sha256 of the data:

* strings should be sha256 of the utf-8 representation of the string represented as hex digits
* binary should be sha256 of the binary in binary
* integer types should be sha256 of the little endian representation of the number in little endian cut down to the right size (1,2,4, or 8 bytes)
* floating point types should be sha256 of the binary representation as either 4 (float) or 8 (double) bytes
* timestamps and dates should convert like integers
* decimal should convert like 128 bit numbers with the result cut to the matching number of bytes

It isn't clear what we should do in the very small data types:
* boolean
* byte
* short

I'd lean toward either making them null or passing them through unchanged.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)