You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Eric Lin (JIRA)" <ji...@apache.org> on 2017/05/07 11:52:04 UTC

[jira] [Commented] (SQOOP-3134) Add option to configure Avro schema output file name with (import + --as-avrodatafile)

    [ https://issues.apache.org/jira/browse/SQOOP-3134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15999808#comment-15999808 ] 

Eric Lin commented on SQOOP-3134:
---------------------------------

[~markuskemper@me.com],

Is there any specific reason you want to do this? I can check if I can get this implemented. But I would like understand the intension.

Thanks

> Add option to configure Avro schema output file name with (import + --as-avrodatafile) 
> ---------------------------------------------------------------------------------------
>
>                 Key: SQOOP-3134
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3134
>             Project: Sqoop
>          Issue Type: Improvement
>            Reporter: Markus Kemper
>
> Please consider adding an option to configure the Avro schema output file name that is created with Sqoop (import + --as-avrodatafile), example cases below.
> {noformat}
> #################
> # STEP 01 - Create Data
> #################
> export MYCONN=jdbc:mysql://mysql.cloudera.com:3306/db_coe
> export MYUSER=sqoop
> export MYPSWD=cloudera
> sqoop list-tables --connect $MYCONN --username $MYUSER --password $MYPSWD
> sqoop eval --connect $MYCONN --username $MYUSER --password $MYPSWD --query "drop table t1"
> sqoop eval --connect $MYCONN --username $MYUSER --password $MYPSWD --query "create table t1 (c1 int, c2 date, c3 varchar(10))"
> sqoop eval --connect $MYCONN --username $MYUSER --password $MYPSWD --query "insert into t1 values (1, current_date, 'some data')"
> sqoop eval --connect $MYCONN --username $MYUSER --password $MYPSWD --query "select * from t1"
> -----------------------------------------
> | c1          | c2         | c3         | 
> -----------------------------------------
> | 1           | 2017-02-13 | some data  | 
> -----------------------------------------
> #################
> # STEP 02 - Import + --table + --as-avrodatafile
> #################
> sqoop import --connect $MYCONN --username $MYUSER --password $MYPSWD --table t1 --target-dir /user/root/t1 --delete-target-dir --num-mappers 1 --as-avrodatafile 
> ls -l ./*
> Output:
> 17/02/13 12:14:52 INFO mapreduce.ImportJobBase: Transferred 413 bytes in 20.6988 seconds (19.9529 bytes/sec)
> 17/02/13 12:14:52 INFO mapreduce.ImportJobBase: Retrieved 1 records.
> ~~~~
> -rw-r--r-- 1 root root   492 Feb 13 12:14 ./t1.avsc <==== want option to configure this file name
> -rw-r--r-- 1 root root 12462 Feb 13 12:14 ./t1.java
> #################
> # STEP 03 - Import + --query + --as-avrodatafile
> #################
> sqoop import --connect $MYCONN --username $MYUSER --password $MYPSWD --query "select * from t1 where \$CONDITIONS" --split-by c1 --target-dir /user/root/t1 --delete-target-dir --num-mappers 1 --as-avrodatafile 
> ls -l ./*
> Output:
> 17/02/13 12:16:58 INFO mapreduce.ImportJobBase: Transferred 448 bytes in 25.2757 seconds (17.7245 bytes/sec)
> 17/02/13 12:16:58 INFO mapreduce.ImportJobBase: Retrieved 1 records.
> ~~~~~
> -rw-r--r-- 1 root root   527 Feb 13 12:16 ./AutoGeneratedSchema.avsc <==== want option to configure this file name
> -rw-r--r-- 1 root root 12590 Feb 13 12:16 ./QueryResult.java
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)