You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Mark Bathori (Jira)" <ji...@apache.org> on 2021/11/10 09:02:00 UTC
[jira] [Updated] (HIVE-25686) UDFSpace functionality issue after HADOOP-17901 and HADOOP-17905
[ https://issues.apache.org/jira/browse/HIVE-25686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mark Bathori updated HIVE-25686:
--------------------------------
Description:
The changes added in HADOOP-17901 and HADOOP-17905 introduced some issues in Hive side. UDFSpace is generating false output because the text length is not determined correctly. It causing unexpected character in the result.
*Repro steps:*
{code:java}
create table t(i int);{code}
{code:java}
insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);{code}
{code:java}
select i, SPACE(i) from t;{code}
Faulty output:
{code:java}
+-----+-----------------+
| i | _c1 |
+-----+-----------------+
| 5 | |
| 6 | |
| 7 | |
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
+-----+-----------------+
{code}
Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.
was:
The changes added in HADOOP-17901 and HADOOP-17905 ** introduced some issues in Hive side. UDFSpace is generating false output because the text length is not determined correctly. It causing unexpected character in the result.
*Repro steps:*
{code:java}
create table t(i int);{code}
{code:java}
insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);{code}
{code:java}
select i, SPACE(i) from t;{code}
Faulty output:
{code:java}
+-----+-----------------+
| i | _c1 |
+-----+-----------------+
| 5 | |
| 6 | |
| 7 | |
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
+-----+-----------------+
{code}
Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.
> UDFSpace functionality issue after HADOOP-17901 and HADOOP-17905
> ----------------------------------------------------------------
>
> Key: HIVE-25686
> URL: https://issues.apache.org/jira/browse/HIVE-25686
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Reporter: Mark Bathori
> Priority: Critical
> Fix For: 4.0.0
>
>
> The changes added in HADOOP-17901 and HADOOP-17905 introduced some issues in Hive side. UDFSpace is generating false output because the text length is not determined correctly. It causing unexpected character in the result.
> *Repro steps:*
> {code:java}
> create table t(i int);{code}
> {code:java}
> insert into t values (5),(6),(7),(8),(9),(10),(11),(12),(13),(14);{code}
> {code:java}
> select i, SPACE(i) from t;{code}
>
> Faulty output:
> {code:java}
> +-----+-----------------+
> | i | _c1 |
> +-----+-----------------+
> | 5 | |
> | 6 | |
> | 7 | |
> | 8 | |
> | 9 | |
> | 10 | |
> | 11 | |
> | 12 | |
> | 13 | |
> | 14 | |
> +-----+-----------------+
> {code}
>
> Also by using --outputformat=csv the generated file contains 0 ascii characters(“EOF”) instead of spaces.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)