You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Eric Newton (JIRA)" <ji...@apache.org> on 2015/09/16 17:40:45 UTC

[jira] [Created] (ACCUMULO-4000) log recovery failed after hard reset

Eric Newton created ACCUMULO-4000:
-------------------------------------

             Summary: log recovery failed after hard reset
                 Key: ACCUMULO-4000
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4000
             Project: Accumulo
          Issue Type: Bug
         Environment: very large cluster, accumulo 1.6.2, hadoop 2.5.0 (cdh 5.3)
            Reporter: Eric Newton
            Assignee: Eric Newton


Had a hardware failure on a single node within a large cluster.  Tablets were migrated away, but one tablet would not recover.  The Closer run by the master to release the write lease on the WAL failed repeatedly.

Afterwards, it was determined the file was small, probably just opened and used at the moment the machine failed.  The block could not be recovered from any replicas.

One question raised: does the write pipeline acknowledge the sync, before the write pipeline completes?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)