You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "David Mollitor (Jira)" <ji...@apache.org> on 2020/06/12 16:35:00 UTC

[jira] [Commented] (HIVE-20827) Inconsistent results for empty arrays

    [ https://issues.apache.org/jira/browse/HIVE-20827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134363#comment-17134363 ] 

David Mollitor commented on HIVE-20827:
---------------------------------------

[~teddy.choi] Possible to make a branch-3 / branch-2 backport?

> Inconsistent results for empty arrays
> -------------------------------------
>
>                 Key: HIVE-20827
>                 URL: https://issues.apache.org/jira/browse/HIVE-20827
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Teddy Choi
>            Assignee: Teddy Choi
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0
>
>         Attachments: HIVE-20827.1.patch
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> LazySimpleDeserializeRead parses an empty array wrong. For example, a line ',' in a text file table with a delimiter ',' and schema 'array<int>, array<array<string>>' shows \[null\], \[\[""\]\], instead of \[\], \[\] with MapReduce engine and vectorized execution enabled. LazySimpleDeserializeRead has following code; 
> {code:java}
> switch (complexField.complexCategory) {
> case LIST:
>   {
>     // Allow for empty string, etc.
>     final boolean isNext = (fieldPosition <= complexFieldEnd);
> {code}
> Empty string value read should be only applied to string families, not to other data types. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)