You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Jarek Jarcec Cecho (JIRA)" <ji...@apache.org> on 2012/11/23 17:12:58 UTC

[jira] [Commented] (SQOOP-721) Duplicating rows on export when exporting from compressed files.

    [ https://issues.apache.org/jira/browse/SQOOP-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13503238#comment-13503238 ] 

Jarek Jarcec Cecho commented on SQOOP-721:
------------------------------------------

We're using CombineFileInputFormat implementation that was copied over from Hadoop namespace to Sqoop namespace to retain exactly the same behavior across all supported Hadoop platforms. It seems that this issue was already fixed in upstream version in MAPREDUCE-1597. I'll try to port new version to our code base.
                
> Duplicating rows on export when exporting from compressed files.
> ----------------------------------------------------------------
>
>                 Key: SQOOP-721
>                 URL: https://issues.apache.org/jira/browse/SQOOP-721
>             Project: Sqoop
>          Issue Type: Bug
>    Affects Versions: 1.4.2
>            Reporter: Jarek Jarcec Cecho
>            Assignee: Jarek Jarcec Cecho
>            Priority: Blocker
>
> It appears that in some situations export will duplicate rows. It seems that this behavior is happening when user is exporting compressed files that are "big enough".

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira