You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/05/14 09:17:00 UTC

[jira] [Work logged] (HIVE-21200) Vectorization: date column throwing java.lang.UnsupportedOperationException for parquet

     [ https://issues.apache.org/jira/browse/HIVE-21200?focusedWorklogId=596565&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-596565 ]

ASF GitHub Bot logged work on HIVE-21200:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 14/May/21 09:16
            Start Date: 14/May/21 09:16
    Worklog Time Spent: 10m 
      Work Description: adrian-wang opened a new pull request #2276:
URL: https://github.com/apache/hive/pull/2276


   …OperationException for parquet
   
   
   ### What changes were proposed in this pull request?
   The date type should be treated as int instead of long.
   
   
   ### Why are the changes needed?
   otherwise exception will be thrown when read parquet files with date column when vectorization is enabled.
   
   
   ### Does this PR introduce _any_ user-facing change?
   no.
   
   
   ### How was this patch tested?
   add a unit test for date.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 596565)
    Remaining Estimate: 0h
            Time Spent: 10m

> Vectorization: date column throwing java.lang.UnsupportedOperationException for parquet
> ---------------------------------------------------------------------------------------
>
>                 Key: HIVE-21200
>                 URL: https://issues.apache.org/jira/browse/HIVE-21200
>             Project: Hive
>          Issue Type: Bug
>          Components: Vectorization
>    Affects Versions: 2.3.4
>            Reporter: Vipin Vishvkarma
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> {code:java}
> set hive.vectorized.execution.enabled=true; 
> set hive.fetch.task.conversion=none; 
> create table t1(d date) stored as parquet; 
> insert into table t1 values("2019-01-01"); 
> select * from t1;
> {code}
> This query will throw java.lang.UnsupportedOperationException:
> {code:java}
> Caused by: java.lang.UnsupportedOperationException at org.apache.parquet.column.values.ValuesReader.readLong(ValuesReader.java:131) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedPrimitiveColumnReader.readLongs(VectorizedPrimitiveColumnReader.java:297) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedPrimitiveColumnReader.readBatchHelper(VectorizedPrimitiveColumnReader.java:176) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedPrimitiveColumnReader.readBatch(VectorizedPrimitiveColumnReader.java:154) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:292) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:238) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:64) at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365)
> {code}
> Above query works fine with Hive 2.1.1 and Hive 3.1.x.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)