You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Jarek Jarcec Cecho (JIRA)" <ji...@apache.org> on 2012/11/23 17:20:58 UTC

[jira] [Comment Edited] (SQOOP-721) Duplicating rows on export when exporting from compressed files.

    [ https://issues.apache.org/jira/browse/SQOOP-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13503238#comment-13503238 ] 

Jarek Jarcec Cecho edited comment on SQOOP-721 at 11/23/12 4:20 PM:
--------------------------------------------------------------------

We're using CombineFileInputFormat implementation that was copied over from Hadoop namespace to Sqoop namespace to retain exactly the same behavior across all supported Hadoop platforms. It seems that this issue was already fixed upstream in MAPREDUCE-1597. I'll try to port new version to our code base.
                
      was (Author: jarcec):
    We're using CombineFileInputFormat implementation that was copied over from Hadoop namespace to Sqoop namespace to retain exactly the same behavior across all supported Hadoop platforms. It seems that this issue was already fixed in upstream version in MAPREDUCE-1597. I'll try to port new version to our code base.
                  
> Duplicating rows on export when exporting from compressed files.
> ----------------------------------------------------------------
>
>                 Key: SQOOP-721
>                 URL: https://issues.apache.org/jira/browse/SQOOP-721
>             Project: Sqoop
>          Issue Type: Bug
>    Affects Versions: 1.4.2
>            Reporter: Jarek Jarcec Cecho
>            Assignee: Jarek Jarcec Cecho
>            Priority: Blocker
>
> It appears that in some situations export will duplicate rows. It seems that this behavior is happening when user is exporting compressed files that are "big enough".

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira