You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Joseph Crotty (JIRA)" <ji...@apache.org> on 2015/10/20 19:54:27 UTC

[jira] [Created] (SQOOP-2628) Import MySQL table UTF-8 data --direct

Joseph Crotty created SQOOP-2628:
------------------------------------

             Summary: Import MySQL table UTF-8 data --direct 
                 Key: SQOOP-2628
                 URL: https://issues.apache.org/jira/browse/SQOOP-2628
             Project: Sqoop
          Issue Type: Bug
          Components: sqoop2-jdbc-connector
    Affects Versions: 1.4.6
         Environment: {code}
$ sqoop version
Warning: /home/hadoop/sqoop-1.4.6.bin__hadoop-2.0.4-alpha/bin/../../hcatalog does not exist! HCatalog jobs will fail.
Please set $HCAT_HOME to the root of your HCatalog installation.
Warning: /home/hadoop/sqoop-1.4.6.bin__hadoop-2.0.4-alpha/bin/../../accumulo does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
15/10/20 17:28:21 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6
Sqoop 1.4.6
git commit id c0c5a81723759fa575844a0a1eae8f510fa32c25
Compiled by root on Mon Apr 27 14:38:36 CST 2015

$ hadoop version
Hadoop 2.6.0-amzn-1
Subversion git@aws157git.com:/pkg/Aws157BigTop -r edd5a97db145470a8723dde24f38c83724e0959c
Compiled by ec2-user on 2015-09-25T14:59Z
Compiled with protoc 2.5.0
>From source with checksum 7beeae31f3c4554b23d92f1e63dc85
This command was run using /usr/lib/hadoop/hadoop-common-2.6.0-amzn-1.jar
{code}
            Reporter: Joseph Crotty


Sqoop doesn't honor UTF-8 chars when import --direct on a MySQL table.

Here is a snippet of comma delimited output from an example script w/o and w/ --direct:
1,Τη γλώσσα,"/fox/\jumps
1,���� ������������,"/fox/\jumps

I looked over sqoop --verbose output and hadoop logs but can't find anything suspicious.

Attached are scripts to create the MySQL utest example table and bash script I used to demonstrate the --direct problem.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)