You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2015/10/19 19:47:05 UTC

[jira] [Updated] (HBASE-14647) Disable TestWALProcedureStoreOnHDFS#testWalRollOnLowReplication

     [ https://issues.apache.org/jira/browse/HBASE-14647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HBASE-14647:
--------------------------
    Attachment: 14647.txt

Disabled for now. Needs work.

Pushed to branch-1.2+

The push added more logging to try and help with this case:

{code}
java.lang.RuntimeException: sync aborted
	at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.pushData(WALProcedureStore.java:492)
	at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.insert(WALProcedureStore.java:335)
	at org.apache.hadoop.hbase.master.procedure.TestWALProcedureStoreOnHDFS.testWalRollOnLowReplication(TestWALProcedureStoreOnHDFS.java:201)
{code}

... looks like i < 50 inserts but I didn't have a log to say how many.

The exception is:

{code}
2015-10-19 04:32:36,051 WARN  [Thread-416] hdfs.DFSOutputStream$DataStreamer(558): DataStreamer Exception
org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /test-logs/state-00000000000000000015.log could only be replicated to 2 nodes instead of minReplication (=3).  There are 3 datanode(s) running and 3 node(s) are excluded in this operation.
...
{code}

So, somehow we are marking all our replicas as bad.

Upping the datanodes to 6 or 10 would make this test more likely to pass?

For study. Let me file issue to reenable.

> Disable TestWALProcedureStoreOnHDFS#testWalRollOnLowReplication
> ---------------------------------------------------------------
>
>                 Key: HBASE-14647
>                 URL: https://issues.apache.org/jira/browse/HBASE-14647
>             Project: HBase
>          Issue Type: Sub-task
>          Components: test
>            Reporter: stack
>            Assignee: stack
>             Fix For: 2.0.0, 1.2.0, 1.3.0, 1.1.3
>
>         Attachments: 14647.txt
>
>
> It failed on two trunk builds. Even after attempts at making the test looser, we still fail. Needs work. Disabling for now while trying to stabilize  build.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)