You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "sutao bian (JIRA)" <ji...@apache.org> on 2013/05/21 11:37:16 UTC

[jira] [Comment Edited] (SQOOP-1047) where i import data from oracle to hdfs useing the --fields-terminated-by '\001' sqoop use null split the data.

    [ https://issues.apache.org/jira/browse/SQOOP-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13662837#comment-13662837 ] 

sutao bian edited comment on SQOOP-1047 at 5/21/13 9:35 AM:
------------------------------------------------------------

I think it use the default separator when i use '\001' as the separator. 

my database : Oracle 11g


some sample data

"   ","SMIN_INFOID","SMIN_COOPID_FK","SMIN_CODE","SMIN_CONTENT","SMIN_WHITELIST","SMIN_INFOTYPE_FK","SMIN_PROVINCE","SMIN_COUNTRY","SMIN_VALUE","SMIN_CHECKRATE","SMIN_CONFIRMKEYWORD","SMIN_NEEDCONFIRM","SMIN_INFONAME","SMIN_SECONDPROVINCE","SMIN_SECONDCODE","SMIN_SECONDCONTENT","SMIN_CUTYID","SMIN_CANCELNUMBER","SMIN_CANCELCONTENT","SMIN_QINCOINVALUE","SMIN_STATUS","SMIN_MOMTSENT","SMIN_USERNAME","SMIN_PASSWORD","SMIN_PRESENTCOIN","SMIN_LIMITDAYVALUE","SMIN_LIMITMONTHVALUE","SMIN_MCC","SMIN_DAILYMTLIMIT","SMIN_GROUPID","SMIN_COMPANY_NAME","SMIN_BUSI_NAME","SMIN_SERVICE_PHONE","SMIN_CONTENT_PRECISE","SMIN_SEPERATOR","SMIN_OSU_CODETEMPLATE","SMIN_IMPORTANT_INFO"
"1","73","71","106610887","312","46000,46002","4","","","0.00","0.71","","0","abc","","","","1","","","","1","0","","","0","","","460","","","","","","","-","",""
"2","74","71","106610881","C","45000,46000,46001,xxx,46002,46003,-1","1","","","1.00","0.71","aaa","0","aaaa","","","","1","","","200.00","1","0","","","0","","","460","","","aaaa","bbbb","4006867616","C","-","",""
"3","76","71","10661088","MK","45000,46000,46001,xxx,46002,46003,-1","2","","","8.00","0.71","bbb","1","bbbbb","-1","Unknown","Y","1","10661088","0000","","1","0","","","0","","","460","","","","","","","-","",""


Thanks again. 

                
      was (Author: biansutao):
    
I think it use the default separator when i use '\001' as the separator. 

my database : Oracle 11g


some sample data

"   ","SMIN_INFOID","SMIN_COOPID_FK","SMIN_CODE","SMIN_CONTENT","SMIN_WHITELIST","SMIN_INFOTYPE_FK","SMIN_PROVINCE","SMIN_COUNTRY","SMIN_VALUE","SMIN_CHECKRATE","SMIN_CONFIRMKEYWORD","SMIN_NEEDCONFIRM","SMIN_INFONAME","SMIN_SECONDPROVINCE","SMIN_SECONDCODE","SMIN_SECONDCONTENT","SMIN_CUTYID","SMIN_CANCELNUMBER","SMIN_CANCELCONTENT","SMIN_QINCOINVALUE","SMIN_STATUS","SMIN_MOMTSENT","SMIN_USERNAME","SMIN_PASSWORD","SMIN_PRESENTCOIN","SMIN_LIMITDAYVALUE","SMIN_LIMITMONTHVALUE","SMIN_MCC","SMIN_DAILYMTLIMIT","SMIN_GROUPID","SMIN_COMPANY_NAME","SMIN_BUSI_NAME","SMIN_SERVICE_PHONE","SMIN_CONTENT_PRECISE","SMIN_SEPERATOR","SMIN_OSU_CODETEMPLATE","SMIN_IMPORTANT_INFO"
"1","73","71","106610887","312","46000,46002","4","","","0.00","0.71","","0","abc","","","","1","","","","1","0","","","0","","","460","","","","","","","-","",""
"2","74","71","106610881","C","45000,46000,46001,xxx,46002,46003,-1","1","","","1.00","0.71","aaa","0","aaaa","","","","1","","","200.00","1","0","","","0","","","460","","","aaaa","bbbb","4006867616","C","-","",""
"3","76","71","10661088","MK","45000,46000,46001,xxx,46002,46003,-1","2","","","8.00","0.71","bbb","1","bbbbb","-1","Unknown","Y","1","10661088","0000","","1","0","","","0","","","460","","","","","","","-","",""

---------------
i try  --null-string and --null-non-string arguments. it will change all null not only separator. 
73^A71^A106610887^A312^A46000,46002^A4^A^A^A^A^A0^A0.71^A^A^A0^Aaaaaa^A^A^A^A^A^A^A1^A^A^A^A^A^A^A1^A0^A^A^A^A^A0^A^A^A^A^A460^A^A^A^A^A^A^A^A^A^A^A^A^A-^A^A^A^A





Thanks again. 

                  
> where i import data from oracle to hdfs  useing the  --fields-terminated-by '\001'   sqoop use null split the data.
> -------------------------------------------------------------------------------------------------------------------
>
>                 Key: SQOOP-1047
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1047
>             Project: Sqoop
>          Issue Type: Bug
>         Environment: Sqoop 1.4.2  oracle 
>            Reporter: sutao bian
>
> i got this data split by null not by \001
> /home/hadoop/sqoop-1.4.2.bin__hadoop-0.20/bin/sqoop import --verbose  --fields-terminated-by '\001' --connect jdbc:oracle:thin:@192.168.2.226:1521:db --username 
> USERETL --password pass  --as-textfile --split-by  SMIN_INFOID   --table  USERETL.TBL_SMSINFO    --target-dir /tmp/bbb
> )5.0ARSnullnullnull27nullnull20011nullnullnullnullnull722nullnullMopaynullnullnullSPACEnullnull
> 90921314004netqin20408,20416,204031nullnull1.5nullnullnullMopay NL(荷兰)1.5EURnullnullnull3nullnull20010nullnullnullnullnull204nullnullMopaynullnullnullSPACEnullnull
> 916539106610797M46000,46002,460072null中国5null*手机安全攻略*null中国移动-手机安全攻略5元包月nullnullnull1106610797QXAQGL20010testtestnullnullnull460nullnull中国移动手机安全攻略nullM_nullnull
> 9432068109276NQMS51011,510211null印尼5500nullnullnullCentili印尼短号MS5500盾9276XLnullnullnull10nullnull20010nullnull0nullnull510nullnullCentili XLnullnullnullSPACEnullnull
> 97620521035960BSTNGMTNMTCM026621301null尼日利亚100nullnullnull黑鲨-尼日利亚100N(MTN)nullnullnull42nullnull20011nullnull0nullnull621nullnull黑鲨nullnullnullSPACEnullnull

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira