You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Anthony Hsu (JIRA)" <ji...@apache.org> on 2016/12/13 23:01:00 UTC
[jira] [Assigned] (HIVE-6365) Alter a partition to be of a
different fileformat than the Table's fileformat. Use insert overwrite to
write data to this partition. The partition fileformat is converted back to
table's fileformat after the insert operation.
[ https://issues.apache.org/jira/browse/HIVE-6365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Anthony Hsu reassigned HIVE-6365:
---------------------------------
Assignee: Anthony Hsu
> Alter a partition to be of a different fileformat than the Table's fileformat. Use insert overwrite to write data to this partition. The partition fileformat is converted back to table's fileformat after the insert operation.
> ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-6365
> URL: https://issues.apache.org/jira/browse/HIVE-6365
> Project: Hive
> Issue Type: Bug
> Environment: emr
> Reporter: Pavan Srinivas
> Assignee: Anthony Hsu
>
> Lets say, there is partitioned table like
> Step1:
> >> CREATE TABLE srcpart (key STRING, value STRING)
> PARTITIONED BY (ds STRING, hr STRING)
> STORED AS TEXTFILE;
> Step2:
> Alter the fileformat for a specific available partition.
> >> alter table srcpart partition(ds="2008-04-08", hr="12") set fileformat orc;
> Step3:
> Describe the partition.
> >> desc formatted srcpart partition(ds="2008-04-08", hr="12")
> .............
> # Storage Information
> SerDe Library: org.apache.hadoop.hive.ql.io.orc.OrcSerde
> InputFormat: org.apache.hadoop.hive.ql.io.orc.OrcInputFormat
> OutputFormat: org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat
> Compressed: No
> Num Buckets: -1
> Bucket Columns: []
> Sort Columns: []
> Storage Desc Params:
> serialization.format 1
> Step4:
> Write the data to this partition using insert overwrite.
> >>insert overwrite table srcpart partition(ds="2008-04-08",hr="12") select key, value from .......
> Step5:
> Describe the partition again.
> >> desc formatted srcpart partition(ds="2008-04-08", hr="12")
> .................
> # Storage Information
> SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
> InputFormat: org.apache.hadoop.mapred.TextInputFormat
> OutputFormat: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
> Compressed: No
> Num Buckets: -1
> Bucket Columns: []
> Sort Columns: []
> Storage Desc Params:
> serialization.format 1
> The fileformat of the partition is converted back to the table's original fileformat. It should have retained and written the data in the modified fileformat.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)