You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Josh Elser (JIRA)" <ji...@apache.org> on 2015/02/20 22:05:12 UTC

[jira] [Resolved] (ACCUMULO-1759) Empty walogs block recovery after power outage.

     [ https://issues.apache.org/jira/browse/ACCUMULO-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Josh Elser resolved ACCUMULO-1759.
----------------------------------
    Resolution: Duplicate

Assuming it wasn't related to encryption, ACCUMULO-3182 might have fixed the issue here.

> Empty walogs block recovery after power outage.
> -----------------------------------------------
>
>                 Key: ACCUMULO-1759
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1759
>             Project: Accumulo
>          Issue Type: Bug
>    Affects Versions: 1.5.0
>         Environment: * HDP 1.3
> ** {{dfs.durable.sync=true}}
> ** {{dfs.datanode.synconclose=true}}
> * encrytion patch from ACCUMULO-998
>            Reporter: Luke Brassard
>
> Power was abruptly cut to the cluster. Upon restart of HDFS, there was a single Rfile that was missing a block. 
> Here are some details about the cluster:
> * HDP 1.3
> ** {{dfs.durable.sync=true}}
> ** {{dfs.datanode.synconclose=true}}
> * encrytion patch from ACCUMULO-998
> After restarting Accumulo, the Master was complaining with a series of these:
> {code}
> 2013-10-09 18:15:16,649 [recovery.HadoopLogCloser] INFO : Waiting for file to be closed /accumulo/wal/10.10.0.1+9997/d52ab315-5ac1-4a5c-9085-67ae29b98b88
> 2013-10-09 18:15:16,663 [recovery.HadoopLogCloser] INFO : Waiting for file to be closed /accumulo/wal/10.10.0.2+9997/d0192739-74e2-43a0-985f-3ed668259995
> 2013-10-09 18:15:16,742 [recovery.HadoopLogCloser] INFO : Waiting for file to be closed /accumulo/wal/10.10.0.3+9997/de54e6dc-964a-4b33-b4fb-052e81749913
> 2013-10-09 18:15:16,833 [recovery.HadoopLogCloser] INFO : Waiting for file to be closed /accumulo/wal/10.10.0.4+9997/cda5daec-25f3-443b-818a-990d3eddd56f
> {code}
> Inspection of the files above showed that they were all empty, but referenced in the {{!METADATA}} table. The solution was to move or remove the files from HDFS and delete the references from the metadata. The instance was then able to stabilize and assign the rest of the tablets.
> It is unclear why these empty walogs existed in the first place. Is it possible that there should have been data in these walogs? Or should the files have been disregarded since they were empty?
> Regarding the Rfile that was missing the block, it was not being referenced by the {{!METADATA}} table, so removing it had no negative effect on the instance. Should there have been some reference to this file?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)