You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Josh Elser (JIRA)" <ji...@apache.org> on 2014/12/03 06:18:12 UTC

[jira] [Resolved] (ACCUMULO-3289) BulkFileIT failed to import files

     [ https://issues.apache.org/jira/browse/ACCUMULO-3289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Josh Elser resolved ACCUMULO-3289.
----------------------------------
    Resolution: Fixed

Haven't had this fail in the same way recently. Added some extra logic in here that will hopefully help debug future failures (if they happen).

Since there were commits for it, I'm marking it as fixed as opposed to cannot reproduce.

> BulkFileIT failed to import files
> ---------------------------------
>
>                 Key: ACCUMULO-3289
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3289
>             Project: Accumulo
>          Issue Type: Bug
>          Components: test
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>             Fix For: 1.6.2, 1.7.0
>
>         Attachments: accumulo-3289.tar.gz
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> Had a failure with BulkFileIT. Looking at the master logs, it appears like the following might have happened:
> * 2 Tservers
> * One of the tservers doesn't respond to communication
> * The master repeatedly contacts it to try to tell it to perform the bulk load
> * The tserver that isn't communicating w/ the master has no errors
> * That tserver logged an assignment that never finished
> * That tserver also got a single bulk import request and the last thing it logged WRT that bulk import was that "Assigning 1 map files to 3 tablets at ...". The 2nd tserver doesn't appear to have anything from that tserver about the import request which should have been incoming.
> * Eventually the master tried to stop that other tserver, but the test timed out in ~30s later (not sure if the tserver would've actually stopped).
> The fact that I see an incomplete assignment and inexplicable bulk load hangs gives me pause in light of ACCUMULO-3276. Will attach some logs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)