You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Owen O'Malley (Jira)" <ji...@apache.org> on 2019/12/04 00:18:00 UTC

[jira] [Commented] (ORC-27) C++ reader does not read dates correctly prior to 1583

    [ https://issues.apache.org/jira/browse/ORC-27?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16987379#comment-16987379 ] 

Owen O'Malley commented on ORC-27:
----------------------------------

This became a much bigger issue because of two factors:
 * One of the third party encryption solutions uses a wide range of dates for their format preserving encryption.
 * Hive 3 has moved to using the proleptic Gregorian calendar.

The combination of the two means that files written by the old Hive are being misread by the newer Hive when using these old dates, which happen because of the third party encryption library.

> C++ reader does not read dates correctly prior to 1583
> ------------------------------------------------------
>
>                 Key: ORC-27
>                 URL: https://issues.apache.org/jira/browse/ORC-27
>             Project: ORC
>          Issue Type: Bug
>            Reporter: Aliaksei Sandryhaila
>            Assignee: Owen O'Malley
>            Priority: Minor
>             Fix For: 1.5.9, 1.6.3, 1.7.0
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Reproducer:
> 1) create a Hive table with a date column
> 2) insert date 01-01-01
> 3) get the corresponding orc file from HDFS
> 4) read with the C++ reader
> The reader produces 0-12-30 (off by 2 days)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)