You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Ashutosh Chauhan (JIRA)" <ji...@apache.org> on 2016/03/23 13:09:25 UTC
[jira] [Commented] (HIVE-12653) The property
"serialization.encoding" in the class
"org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe" does not work
[ https://issues.apache.org/jira/browse/HIVE-12653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208309#comment-15208309 ]
Ashutosh Chauhan commented on HIVE-12653:
-----------------------------------------
+1
> The property "serialization.encoding" in the class "org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe" does not work
> ---------------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-12653
> URL: https://issues.apache.org/jira/browse/HIVE-12653
> Project: Hive
> Issue Type: Improvement
> Components: Contrib
> Affects Versions: 1.2.1
> Reporter: yangfang
> Assignee: yangfang
> Attachments: HIVE-12653.2.patch, HIVE-12653.3.patch, HIVE-12653.patch, HIVE-12653.patch
>
>
> when I create table with ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe' and load some files with chinese encoded by GBK:
> create table PersonInfo (cod_fn_ent string, num_seq_trc_form string, date_tr string,
> num_jrn_no string, cod_trc_form_typ string,id_intl_ip string, name string )
> ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe'
> WITH SERDEPROPERTIES ("field.delim"="|!","serialization.encoding"='GBK');
> load data local inpath '/home/mr/hive/99-BoEing-IF_PMT_NOTE-2G-20151019-00000' overwrite into table PersonInfo;
> I found chinese disorder code in the table and 'serialization.encoding' does not work, the chinese disorder data list as below:
> | ���� 99999999�ϴ����������� 0624624002��ʱ����������
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)