You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Virendhar Sivaraman (JIRA)" <ji...@apache.org> on 2016/02/17 11:49:18 UTC

[jira] [Commented] (SQOOP-1055) Add option to export from Hive use HQL query

    [ https://issues.apache.org/jira/browse/SQOOP-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15150275#comment-15150275 ] 

Virendhar Sivaraman commented on SQOOP-1055:
--------------------------------------------

This will be a handy feature to have. Currently I am creating a staging table in hive using a hive query then exporting this hive table to mysql.

> Add option to export from Hive use HQL query
> --------------------------------------------
>
>                 Key: SQOOP-1055
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1055
>             Project: Sqoop
>          Issue Type: Improvement
>            Reporter: Hari Sekhon
>
> Sqoop currently has a --query option for import but not for export.
> It would be nice if the export --query option supporting HiveQL could be added as users currently have to create a temporary table and then export that as a two step process with a full disk re-write of all the to-be-exported data to a new table before the sqoop export command is started.
> Since Sqoop executes a distributed map-only job, I believe certain queries such as joins that have to be done via a reduce phase will yield little performance improvement due to the map->reduce intermediate writes needing to be written anyway. However we could save on the final reduce phase writes and also turn this in to a more convenient one step instead two step process.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)