You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Andrew Kyle Purtell (Jira)" <ji...@apache.org> on 2022/07/12 19:42:00 UTC

[jira] [Comment Edited] (HBASE-27162) TestReplicationValueCompressedWAL is flaky

    [ https://issues.apache.org/jira/browse/HBASE-27162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565988#comment-17565988 ] 

Andrew Kyle Purtell edited comment on HBASE-27162 at 7/12/22 7:41 PM:
----------------------------------------------------------------------

Would you like me to submit a PR to @Ignore this test for now? To clean up the nightlies? I do not have enough to go on right now to understand the problem. Can come back to this later and try to de-flake it.


was (Author: apurtell):
Would you like me to submit a PR to @Ignore this test for now? I do not have enough to go on right now to understand the problem. Can come back to this later and try to de-flake it.

> TestReplicationValueCompressedWAL is flaky
> ------------------------------------------
>
>                 Key: HBASE-27162
>                 URL: https://issues.apache.org/jira/browse/HBASE-27162
>             Project: HBase
>          Issue Type: Bug
>          Components: Replication, test, wal
>            Reporter: Duo Zhang
>            Priority: Major
>
> https://nightlies.apache.org/hbase/HBase-Flaky-Tests/master/3752/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.replication.regionserver.TestReplicationValueCompressedWAL-output.txt
> {noformat}
> 2022-06-26T08:41:27,359 DEBUG [RS_REFRESH_PEER-regionserver/jenkins-hbase19:0-0.replicationSource,2.replicationSource.wal-reader.jenkins-hbase19.apache.org%2C44659%2C1656232862372,2] wal.ProtobufLogReader(448): Encountered a malformed edit, seeking back to last good position in file, from 65537 to 65536
> java.io.EOFException: Partial PB while reading WAL, probably an unexpected EOF, ignoring. current offset=65537
> 	at org.apache.hadoop.hbase.regionserver.wal.ProtobufLogReader.readNext(ProtobufLogReader.java:390) ~[classes/:?]
> 	at org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:104) ~[classes/:?]
> 	at org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:92) ~[classes/:?]
> 	at org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.readNextEntryAndRecordReaderPosition(WALEntryStream.java:264) ~[classes/:?]
> 	at org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:178) ~[classes/:?]
> 	at org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:103) ~[classes/:?]
> 	at org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.readWALEntries(ReplicationSourceWALReader.java:212) ~[classes/:?]
> 	at org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:148) ~[classes/:?]
> {noformat}
> Actually, we should still have a lot of edits after this point. This could  be a critical problem as we will hang replication forever...



--
This message was sent by Atlassian Jira
(v8.20.10#820010)