You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Joey Echeverria (JIRA)" <ji...@apache.org> on 2011/08/19 20:02:31 UTC

[jira] [Updated] (SQOOP-319) The --hive-drop-import-delims option should accept a replacement string

     [ https://issues.apache.org/jira/browse/SQOOP-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joey Echeverria updated SQOOP-319:
----------------------------------

    Attachment: SQOOP-319-1.patch

I added a new option, --hive-delims-replacement, which lets you pass in a replacement string. I did it with a new option to remain backwards compatible with the existing interface. I added a test for the new option.

> The --hive-drop-import-delims option should accept a replacement string
> -----------------------------------------------------------------------
>
>                 Key: SQOOP-319
>                 URL: https://issues.apache.org/jira/browse/SQOOP-319
>             Project: Sqoop
>          Issue Type: Bug
>          Components: hive-integration
>    Affects Versions: 1.3.0
>            Reporter: Joey Echeverria
>            Priority: Minor
>         Attachments: SQOOP-319-1.patch
>
>
> When importing data into hive, you have the option of dropping the hive delimiters in data fields. It would be more useful to replace the delimiters with a user defined string. Often times the dropped delimiters (like \n) are separating words. If I want to split on white space in my hive queries, I'll now get two words merged together. A more desirable behavior would be to replace it with a space. Making it user configurable will give the most flexibility.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira