You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Zoltan Haindrich (Jira)" <ji...@apache.org> on 2021/10/26 10:46:00 UTC
[jira] [Reopened] (HIVE-25553) Support Map data-type natively in
Arrow format
[ https://issues.apache.org/jira/browse/HIVE-25553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zoltan Haindrich reopened HIVE-25553:
-------------------------------------
reverted from master:
* it was committed without a clean testrun
* 5 tests were broken by these changes
** one of the test is clearly arrow related(org.apache.hadoop.hive.ql.io.arrow.TestSerializer)
http://ci.hive.apache.org/job/hive-precommit/job/master/lastCompletedBuild/testReport/junit/org.apache.hadoop.hive.ql.io.arrow/TestSerializer/Testing___split_06___PostProcess___testEmptyComplexStruct/
[~sankarh] why did you merged the changes even thru the PR was marked as tests-failed? it didn't even had a green testrun!
http://ci.hive.apache.org/job/hive-precommit/job/PR-2689/
> Support Map data-type natively in Arrow format
> ----------------------------------------------
>
> Key: HIVE-25553
> URL: https://issues.apache.org/jira/browse/HIVE-25553
> Project: Hive
> Issue Type: Improvement
> Components: llap, Serializers/Deserializers
> Reporter: Adesh Kumar Rao
> Assignee: Sruthi Mooriyathvariam
> Priority: Major
> Labels: pull-request-available
> Fix For: 4.0.0
>
> Time Spent: 2.5h
> Remaining Estimate: 0h
>
> Currently ArrowColumnarBatchSerDe converts map datatype as a list of structs data-type (where stuct is containing the key-value pair of the map). This causes issues when reading Map datatype using llap-ext-client as it reads a list of structs instead.
> HiveWarehouseConnector which uses the llap-ext-client throws exception when the schema (containing Map data type) is different from actual data (list of structs).
>
> Fixing this issue requires upgrading arrow version (where map data-type is supported), modifying ArrowColumnarBatchSerDe and corresponding Serializer/Deserializer to not use list as a workaround for map and use the arrow map data-type instead.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)