You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Varun Raval (Jira)" <ji...@apache.org> on 2021/11/30 16:55:00 UTC
[jira] [Comment Edited] (ORC-1054) Unable to compare data (generated using CSV to ORC converter) on timestamp column
[ https://issues.apache.org/jira/browse/ORC-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17451216#comment-17451216 ]
Varun Raval edited comment on ORC-1054 at 11/30/21, 4:54 PM:
-------------------------------------------------------------
Yes, I get all the results on full query `select * from csvtest`. Sample orc file is file1.orc. It has single value 2021-11-10 01:02:15.553.
The destination table in Hive is an external table. Its description is show in hive_table_desc.jpg.
was (Author: vraval48):
Yes, I get all the results on fill query `select * from csvtest`. Sample orc file is file1.orc. It has single value 2021-11-10 01:02:15.553.
The destination table in Hive is an external table. Its description is show in hive_table_desc.jpg.
> Unable to compare data (generated using CSV to ORC converter) on timestamp column
> ---------------------------------------------------------------------------------
>
> Key: ORC-1054
> URL: https://issues.apache.org/jira/browse/ORC-1054
> Project: ORC
> Issue Type: Bug
> Components: C++, Java
> Reporter: Varun Raval
> Priority: Major
> Attachments: file1.orc, hive_table_desc.jpg, timestamp1.csv
>
>
> I have a CSV file with timestamp columns. Then I convert CSV file to ORC file using CSV to ORC converter and place the ORC file in a hive table backed by ORC files. I am not able to query the data using timestamp column on Apache Hive beeline. If timestamp is present in the select query, the corresponding rows are not retrieved.
> For example, table csvtest has single column (t) as timestamp datatype. It has a row '2021-11-10 01:02:15'. Query "select * from csvtest where t > '2021-11-10 00:00:00'" does not return any result. Query "select * from csvtest" returns the correct row.
> However, the same query "select * from csvtest where t > '2021-11-10 00:00:00'" works with Spark SQL and rows are retrieved correctly.
> Is this issue with how ORC file is created or is it some hive configuration issue?
> I have tested it on the master branch and results are same for both cpp and java csv to orc converters.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)