You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Eric Newton (JIRA)" <ji...@apache.org> on 2013/04/02 16:43:15 UTC

[jira] [Commented] (ACCUMULO-1044) bulk imported files showing up in metadata after bulk import fails

    [ https://issues.apache.org/jira/browse/ACCUMULO-1044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13619863#comment-13619863 ] 

Eric Newton commented on ACCUMULO-1044:
---------------------------------------

This may be impossible to fix without a complete restructuring of bulk import.

There's a race condition between the update of the metadata with a bulk-file-loaded flag, and the closing of the transaction.  The current code keeps this window very small, but it is still possible.

Another "fix" is to never move files to the failed directory: always copy them.  However, the race condition is just moved from the Master to the Garbage Collector.

The work-around now is to increase the number of retries to a very high number.
                
> bulk imported files showing up in metadata after bulk import fails
> ------------------------------------------------------------------
>
>                 Key: ACCUMULO-1044
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1044
>             Project: Accumulo
>          Issue Type: Bug
>          Components: master, tserver
>    Affects Versions: 1.4.2
>            Reporter: Eric Newton
>            Assignee: Eric Newton
>            Priority: Critical
>             Fix For: 1.5.0, 1.4.4
>
>
> Bulk import fails.  The file is moved to the failures directory.
> But references in the !METADATA table remain.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira