You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Joseph Crotty (JIRA)" <ji...@apache.org> on 2015/10/20 20:01:27 UTC

[jira] [Updated] (SQOOP-2628) Import MySQL table --direct UTF-8 data corrupted

     [ https://issues.apache.org/jira/browse/SQOOP-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joseph Crotty updated SQOOP-2628:
---------------------------------
    Summary: Import MySQL table --direct UTF-8 data corrupted  (was: Import MySQL table UTF-8 data --direct )

> Import MySQL table --direct UTF-8 data corrupted
> ------------------------------------------------
>
>                 Key: SQOOP-2628
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2628
>             Project: Sqoop
>          Issue Type: Bug
>          Components: sqoop2-jdbc-connector
>    Affects Versions: 1.4.6
>         Environment: sqoop 1.4.6 hadoop 2.6.0-amzn-1
>            Reporter: Joseph Crotty
>         Attachments: create_utest_table.sql, sqoop_import.sh, sqoop_utest.log
>
>
> Sqoop doesn't honor UTF-8 chars when import --direct on a MySQL table.
> Here is the key comma delimited output from attached example script w/o and w/ --direct:
> {code}
> 1,Τη γλώσσα,"/fox/\jumps
> 1,���� ������������,"/fox/\jumps
> {code}
> I looked over sqoop --verbose output and hadoop logs but can't find anything suspicious.
> Attached are scripts to create the MySQL utest example table and bash script I used to demonstrate the --direct problem.
> Environment
> {code}
> $ sqoop version
> Warning: /home/hadoop/sqoop-1.4.6.bin__hadoop-2.0.4-alpha/bin/../../hcatalog does not exist! HCatalog jobs will fail.
> Please set $HCAT_HOME to the root of your HCatalog installation.
> Warning: /home/hadoop/sqoop-1.4.6.bin__hadoop-2.0.4-alpha/bin/../../accumulo does not exist! Accumulo imports will fail.
> Please set $ACCUMULO_HOME to the root of your Accumulo installation.
> 15/10/20 17:28:21 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6
> Sqoop 1.4.6
> git commit id c0c5a81723759fa575844a0a1eae8f510fa32c25
> Compiled by root on Mon Apr 27 14:38:36 CST 2015
> $ hadoop version
> Hadoop 2.6.0-amzn-1
> Subversion git@aws157git.com:/pkg/Aws157BigTop -r edd5a97db145470a8723dde24f38c83724e0959c
> Compiled by ec2-user on 2015-09-25T14:59Z
> Compiled with protoc 2.5.0
> From source with checksum 7beeae31f3c4554b23d92f1e63dc85
> This command was run using /usr/lib/hadoop/hadoop-common-2.6.0-amzn-1.jar
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)