You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Mark Bathori (Jira)" <ji...@apache.org> on 2021/11/10 09:02:00 UTC

[jira] [Updated] (HIVE-25686) UDFSpace functionality issue after HADOOP-17901 and HADOOP-17905

     [ https://issues.apache.org/jira/browse/HIVE-25686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mark Bathori updated HIVE-25686:
--------------------------------
    Description: 
The changes added in HADOOP-17901 and HADOOP-17905 introduced some issues in Hive side. UDFSpace is generating false output because the text length is not determined correctly. It causing unexpected character in the result.

*Repro steps:*
{code:java}
create table t(i int);{code}
{code:java}
insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);{code}
{code:java}
select i, SPACE(i) from t;{code}
 
Faulty output:
{code:java}
+-----+-----------------+
|  i  |       _c1       |
+-----+-----------------+
| 5   |                 |
| 6   |                 |
| 7   |                |
| 8   |                 |
| 9   |                |
| 10  |               |
| 11  |                 |
| 12  |                |
| 13  |               |
| 14  |              |
+-----+-----------------+
{code}
 

Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.

  was:
The changes added in HADOOP-17901 and HADOOP-17905 ** introduced some issues in Hive side. UDFSpace is generating false output because the text length is not determined correctly. It causing unexpected character in the result.

*Repro steps:*
{code:java}
create table t(i int);{code}
{code:java}
insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);{code}
{code:java}
select i, SPACE(i) from t;{code}
 
Faulty output:
{code:java}
+-----+-----------------+
|  i  |       _c1       |
+-----+-----------------+
| 5   |                 |
| 6   |                 |
| 7   |                |
| 8   |                 |
| 9   |                |
| 10  |               |
| 11  |                 |
| 12  |                |
| 13  |               |
| 14  |              |
+-----+-----------------+
{code}
 

Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.


> UDFSpace functionality issue after HADOOP-17901 and HADOOP-17905
> ----------------------------------------------------------------
>
>                 Key: HIVE-25686
>                 URL: https://issues.apache.org/jira/browse/HIVE-25686
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>            Reporter: Mark Bathori
>            Priority: Critical
>             Fix For: 4.0.0
>
>
> The changes added in HADOOP-17901 and HADOOP-17905 introduced some issues in Hive side. UDFSpace is generating false output because the text length is not determined correctly. It causing unexpected character in the result.
> *Repro steps:*
> {code:java}
> create table t(i int);{code}
> {code:java}
> insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);{code}
> {code:java}
> select i, SPACE(i) from t;{code}
>  
> Faulty output:
> {code:java}
> +-----+-----------------+
> |  i  |       _c1       |
> +-----+-----------------+
> | 5   |                 |
> | 6   |                 |
> | 7   |                |
> | 8   |                 |
> | 9   |                |
> | 10  |               |
> | 11  |                 |
> | 12  |                |
> | 13  |               |
> | 14  |              |
> +-----+-----------------+
> {code}
>  
> Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)