You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2009/02/05 20:19:59 UTC

[jira] Created: (HIVE-274) User cannot specify schema of the reduce output, only the column names can be specified

User cannot specify schema of the reduce output, only the column names can be specified
---------------------------------------------------------------------------------------

                 Key: HIVE-274
                 URL: https://issues.apache.org/jira/browse/HIVE-274
             Project: Hadoop Hive
          Issue Type: New Feature
          Components: Query Processor
            Reporter: Namit Jain



Consider the test: mapreduce1.q:

FROM src
INSERT OVERWRITE TABLE dest1
MAP src.key, CAST(src.key / 10 AS INT), CAST(src.key % 10 AS INT), src.value
USING '/bin/cat' AS (tkey, ten, one, tvalue)
DISTRIBUTE BY tvalue, tkey
SORT BY ten, one;


Although, in the MAP input, the user is casting the inputs as INT (ten, one), the reduce output ('/bin/cat') still treats them as string, and the sorting happens assuming them as strings - even if the
user wanted to have integer sorting, the output has lexicographic sorting (look at the output mapreduce1.q.out).


In the AS, the user should be able to specify the schema in the same way as CREATE TABLE.




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (HIVE-274) User cannot specify schema of the script output, only the column names can be specified

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zheng Shao resolved HIVE-274.
-----------------------------

    Resolution: Fixed

This is fixed together with HIVE-708

> User cannot specify schema of the script output, only the column names can be specified
> ---------------------------------------------------------------------------------------
>
>                 Key: HIVE-274
>                 URL: https://issues.apache.org/jira/browse/HIVE-274
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Query Processor
>            Reporter: Namit Jain
>
> Consider the test: mapreduce1.q:
> FROM src
> INSERT OVERWRITE TABLE dest1
> MAP src.key, CAST(src.key / 10 AS INT), CAST(src.key % 10 AS INT), src.value
> USING '/bin/cat' AS (tkey, ten, one, tvalue)
> DISTRIBUTE BY tvalue, tkey
> SORT BY ten, one;
> Although, in the MAP input, the user is casting the inputs as INT (ten, one), the reduce output ('/bin/cat') still treats them as string, and the sorting happens assuming them as strings - even if the
> user wanted to have integer sorting, the output has lexicographic sorting (look at the output mapreduce1.q.out).
> In the AS, the user should be able to specify the schema in the same way as CREATE TABLE.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-274) User cannot specify schema of the script output, only the column names can be specified

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zheng Shao updated HIVE-274:
----------------------------

    Description: 
Consider the test: mapreduce1.q:

FROM src
INSERT OVERWRITE TABLE dest1
MAP src.key, CAST(src.key / 10 AS INT), CAST(src.key % 10 AS INT), src.value
USING '/bin/cat' AS (tkey, ten, one, tvalue)
DISTRIBUTE BY tvalue, tkey
SORT BY ten, one;


Although, in the MAP input, the user is casting the inputs as INT (ten, one), the reduce output ('/bin/cat') still treats them as string, and the sorting happens assuming them as strings - even if the
user wanted to have integer sorting, the output has lexicographic sorting (look at the output mapreduce1.q.out).


In the AS, the user should be able to specify the schema in the same way as CREATE TABLE.




  was:

Consider the test: mapreduce1.q:

FROM src
INSERT OVERWRITE TABLE dest1
MAP src.key, CAST(src.key / 10 AS INT), CAST(src.key % 10 AS INT), src.value
USING '/bin/cat' AS (tkey, ten, one, tvalue)
DISTRIBUTE BY tvalue, tkey
SORT BY ten, one;


Although, in the MAP input, the user is casting the inputs as INT (ten, one), the reduce output ('/bin/cat') still treats them as string, and the sorting happens assuming them as strings - even if the
user wanted to have integer sorting, the output has lexicographic sorting (look at the output mapreduce1.q.out).


In the AS, the user should be able to specify the schema in the same way as CREATE TABLE.




        Summary: User cannot specify schema of the script output, only the column names can be specified  (was: User cannot specify schema of the reduce output, only the column names can be specified)

> User cannot specify schema of the script output, only the column names can be specified
> ---------------------------------------------------------------------------------------
>
>                 Key: HIVE-274
>                 URL: https://issues.apache.org/jira/browse/HIVE-274
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Query Processor
>            Reporter: Namit Jain
>
> Consider the test: mapreduce1.q:
> FROM src
> INSERT OVERWRITE TABLE dest1
> MAP src.key, CAST(src.key / 10 AS INT), CAST(src.key % 10 AS INT), src.value
> USING '/bin/cat' AS (tkey, ten, one, tvalue)
> DISTRIBUTE BY tvalue, tkey
> SORT BY ten, one;
> Although, in the MAP input, the user is casting the inputs as INT (ten, one), the reduce output ('/bin/cat') still treats them as string, and the sorting happens assuming them as strings - even if the
> user wanted to have integer sorting, the output has lexicographic sorting (look at the output mapreduce1.q.out).
> In the AS, the user should be able to specify the schema in the same way as CREATE TABLE.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.