You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Jarek Jarcec Cecho (JIRA)" <ji...@apache.org> on 2013/07/30 05:15:50 UTC

[jira] [Updated] (SQOOP-1154) Sqoop2: Text partitioner might miss or include edge values

     [ https://issues.apache.org/jira/browse/SQOOP-1154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jarek Jarcec Cecho updated SQOOP-1154:
--------------------------------------

    Attachment: import_test_case.patch
    
> Sqoop2: Text partitioner might miss or include edge values
> ----------------------------------------------------------
>
>                 Key: SQOOP-1154
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1154
>             Project: Sqoop
>          Issue Type: Bug
>    Affects Versions: 1.99.2
>            Reporter: Jarek Jarcec Cecho
>             Fix For: 1.99.3
>
>         Attachments: import_test_case.patch
>
>
> Attached is test case that will use Varchar partitioner to generate 3, 5, 10 and 13 partitions for interval "Breezy Badger" to "Warty Warthog". For all cases generated partitions looks like the following:
> {code}
> 'Bree' <= VCCOL AND VCCOL < SOME_VALUE
> SOME_VALUE <= VCCOL AND VCCOL <= 'Wart'
> {code}
> As {{'Warty Warthog' > 'Wart'}}, the last value will be never imported. Also similarly as {{'Bree' < 'Breezy Badger'}}, additional values might be imported as well (for example 'Breedy Budget'). I think that the varchar partitioner must have the interval boundaries there without any truncation, for example:
> {code}
> 'Breezy Badget' <= VCCOL AND VCCOL < SOME_VALUE
> SOME_VALUE <= VCCOL AND VCCOL <= 'Warty Warthog'
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira