You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Gang Wu (Jira)" <ji...@apache.org> on 2022/03/17 07:27:00 UTC
[jira] [Resolved] (ORC-1055) [C++] Timestamp values read in Hive are different when using ORC file created using CSV to ORC converter tools
[ https://issues.apache.org/jira/browse/ORC-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Wu resolved ORC-1055.
--------------------------
Assignee: ZhangXin
Resolution: Fixed
This is resolved via https://github.com/apache/orc/pull/975
> [C++] Timestamp values read in Hive are different when using ORC file created using CSV to ORC converter tools
> --------------------------------------------------------------------------------------------------------------
>
> Key: ORC-1055
> URL: https://issues.apache.org/jira/browse/ORC-1055
> Project: ORC
> Issue Type: Bug
> Components: C++
> Reporter: Yiqun Zhang
> Assignee: ZhangXin
> Priority: Major
> Attachments: converted_by_cpp.orc, timestamp.csv
>
>
> I have a CSV file that has a column having timestamp values as 0001-01-01 00:00:00.0. Then I convert CSV file to ORC file using CSV to ORC converter and place the ORC file in a hive table backed by ORC files. On querying the data using Hive beeline and Spark SQL, different results are obtained
> If converted using CPP tool, value read using Hive beeline and Spark SQL queries is 0001-01-03 00:00:00
> Reported by [~vraval48]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)