You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Eric Newton (JIRA)" <ji...@apache.org> on 2012/12/12 16:31:22 UTC

[jira] [Commented] (ACCUMULO-408) Large bulk imports timing out

    [ https://issues.apache.org/jira/browse/ACCUMULO-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13530030#comment-13530030 ] 

Eric Newton commented on ACCUMULO-408:
--------------------------------------

On large clusters, I'm seeing bulk imports fail, so the master moves the files, and yet there are entries in the METADATA table for those files.  Later, tservers complain that the files do not exist.

                
> Large bulk imports timing out
> -----------------------------
>
>                 Key: ACCUMULO-408
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-408
>             Project: Accumulo
>          Issue Type: Bug
>          Components: master, tserver
>    Affects Versions: 1.4.0
>            Reporter: Keith Turner
>            Assignee: Eric Newton
>            Priority: Critical
>             Fix For: 1.5.0
>
>
> Large bulk imports that take more than 2 minutes to process are failing.  The master ask a tablet sever to bulk import the files and times out after 2 minutes.  It retries, but the retry will take over 2 minutes.  Eventually it gives up and starts coping all of the data to the fail dir which takes forever.  This copy is all through the master and for large data basically never completes.  The bulk import threads on the tablet server probably complete eventually, so the data is still there.
> The master should not time these request out.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira