You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Matteo Bertozzi (JIRA)" <ji...@apache.org> on 2014/03/18 23:04:43 UTC

[jira] [Commented] (HBASE-10786) If snapshot verification fails with 'Regions moved', the message should contain the name of region causing the failure

    [ https://issues.apache.org/jira/browse/HBASE-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939870#comment-13939870 ] 

Matteo Bertozzi commented on HBASE-10786:
-----------------------------------------

Is this doing what you expect? looks like is showing just the first missing region and then it throws the CorruptedSnapshotException

> If snapshot verification fails with 'Regions moved', the message should contain the name of region causing the failure
> ----------------------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-10786
>                 URL: https://issues.apache.org/jira/browse/HBASE-10786
>             Project: HBase
>          Issue Type: Task
>            Reporter: Ted Yu
>            Assignee: Ted Yu
>            Priority: Minor
>         Attachments: 10786-v1.txt
>
>
> I was trying to find cause for test failure in https://builds.apache.org/job/PreCommit-HBASE-Build/9036//testReport/org.apache.hadoop.hbase.snapshot/TestSecureExportSnapshot/testExportRetry/ :
> {code}
> org.apache.hadoop.hbase.snapshot.HBaseSnapshotException: org.apache.hadoop.hbase.snapshot.HBaseSnapshotException: Snapshot { ss=emptySnaptb0-1395177346656 table=testtb-1395177346656 type=FLUSH } had an error.  Procedure emptySnaptb0-1395177346656 { waiting=[] done=[] }
> 	at org.apache.hadoop.hbase.master.snapshot.SnapshotManager.isSnapshotDone(SnapshotManager.java:342)
> 	at org.apache.hadoop.hbase.master.HMaster.isSnapshotDone(HMaster.java:3007)
> 	at org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$2.callBlockingMethod(MasterProtos.java:40494)
> 	at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2020)
> 	at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:98)
> 	at org.apache.hadoop.hbase.ipc.FifoRpcScheduler$1.run(FifoRpcScheduler.java:73)
> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
> 	at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:138)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
> 	at java.lang.Thread.run(Thread.java:662)
> Caused by: org.apache.hadoop.hbase.snapshot.CorruptedSnapshotException via Failed taking snapshot { ss=emptySnaptb0-1395177346656 table=testtb-1395177346656 type=FLUSH } due to exception:Regions moved during the snapshot '{ ss=emptySnaptb0-1395177346656 table=testtb-1395177346656 type=FLUSH }'. expected=9 snapshotted=8:org.apache.hadoop.hbase.snapshot.CorruptedSnapshotException: Regions moved during the snapshot '{ ss=emptySnaptb0-1395177346656 table=testtb-1395177346656 type=FLUSH }'. expected=9 snapshotted=8
> 	at org.apache.hadoop.hbase.errorhandling.ForeignExceptionDispatcher.rethrowException(ForeignExceptionDispatcher.java:83)
> 	at org.apache.hadoop.hbase.master.snapshot.TakeSnapshotHandler.rethrowExceptionIfFailed(TakeSnapshotHandler.java:320)
> 	at org.apache.hadoop.hbase.master.snapshot.SnapshotManager.isSnapshotDone(SnapshotManager.java:332)
> 	... 11 more
> {code}
> However, it is not clear which region caused the verification to fail.
> I searched for log from balancer but found none.
> The exception message should include region name which caused the verification to fail.



--
This message was sent by Atlassian JIRA
(v6.2#6252)