You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2010/01/01 02:44:29 UTC

[jira] Updated: (HIVE-1023) typedbytes: datatypes should be derived from data

     [ https://issues.apache.org/jira/browse/HIVE-1023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1023:
-----------------------------

    Fix Version/s: 0.5.0
           Status: Patch Available  (was: Open)

> typedbytes: datatypes should be derived from data
> -------------------------------------------------
>
>                 Key: HIVE-1023
>                 URL: https://issues.apache.org/jira/browse/HIVE-1023
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.5.0
>
>         Attachments: hive.1023.1.patch
>
>
> FROM (
> FROM src
> SELECT TRANSFORM(src.key, src.value) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.TypedBytesSerDe'
> RECORDWRITER 'org.apache.hadoop.hive.contrib.util.typedbytes.TypedBytesRecordWriter'
> USING '/bin/cat'
> AS (tkey, tvalue) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.TypedBytesSerDe'
> RECORDREADER 'org.apache.hadoop.hive.contrib.util.typedbytes.TypedBytesRecordReader'
> ) tmap
> INSERT OVERWRITE TABLE dest1 SELECT tkey, tvalue;
> The output is interpreted as a string - however, it is assumed that the script is retuning string data.
> It would be useful if the reader and the deserializer can be decoupled.
> The record reader (TypedBytesRecordReader) will read the typed data (independent of the output schema)
> and then convert it according to the output schema. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.