You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@sqoop.apache.org by Buntu Dev <bu...@gmail.com> on 2016/08/16 19:56:28 UTC

Sqoop with --as-parquetfile results in "org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern" error

I'm trying to use sqoop import using this command to import from mysql to
Hive (as parquet):

~~~~~~~~~~~~~~~~~~~~~~~~
sqoop import \
 --warehouse-dir
        /usr/hive/wh/my.db
 --table
        src
 --hive-import
 --hive-database
       usr/hive/wh/my.db
 --hive-table
       dest
 --as-parquetfile
 --connect
       jdbc:mysql://<dbhost>:<port>/<db>
 --username
       <username>
 --password
       <password>
~~~~~~~~~~~~~~~~~~~~~~~~

But end up running into these errors:

~~~~~~~~
org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern:
*dataset:hive:/usr/hive/wh/my.db/dest*
Check that JARs for hive datasets are on the classpath
at org.kitesdk.data.spi.Registration.lookupDatasetUri(Registration.java:128)
at org.kitesdk.data.Datasets.exists(Datasets.java:624)
at org.kitesdk.data.Datasets.exists(Datasets.java:646)
....
~~~~~~~~~~

Are we expected to create the dataset prior to running Sqoop import? or Is
this related to some hive jars missing from classpath?


Thanks!