You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "David Mollitor (Jira)" <ji...@apache.org> on 2020/06/12 16:35:00 UTC
[jira] [Commented] (HIVE-20827) Inconsistent results for empty
arrays
[ https://issues.apache.org/jira/browse/HIVE-20827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134363#comment-17134363 ]
David Mollitor commented on HIVE-20827:
---------------------------------------
[~teddy.choi] Possible to make a branch-3 / branch-2 backport?
> Inconsistent results for empty arrays
> -------------------------------------
>
> Key: HIVE-20827
> URL: https://issues.apache.org/jira/browse/HIVE-20827
> Project: Hive
> Issue Type: Bug
> Reporter: Teddy Choi
> Assignee: Teddy Choi
> Priority: Major
> Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20827.1.patch
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> LazySimpleDeserializeRead parses an empty array wrong. For example, a line ',' in a text file table with a delimiter ',' and schema 'array<int>, array<array<string>>' shows \[null\], \[\[""\]\], instead of \[\], \[\] with MapReduce engine and vectorized execution enabled. LazySimpleDeserializeRead has following code;
> {code:java}
> switch (complexField.complexCategory) {
> case LIST:
> {
> // Allow for empty string, etc.
> final boolean isNext = (fieldPosition <= complexFieldEnd);
> {code}
> Empty string value read should be only applied to string families, not to other data types.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)