You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Ashu Pachauri (JIRA)" <ji...@apache.org> on 2016/11/07 20:45:58 UTC

[jira] [Created] (HBASE-17041) VerifyReplication reporting false inconsistencies

Ashu Pachauri created HBASE-17041:
-------------------------------------

             Summary: VerifyReplication reporting false inconsistencies
                 Key: HBASE-17041
                 URL: https://issues.apache.org/jira/browse/HBASE-17041
             Project: HBase
          Issue Type: Bug
          Components: Replication, Scanners
    Affects Versions: 1.3.0
            Reporter: Ashu Pachauri
            Priority: Critical


VerifyReplication often report inconsistencies that are not there. The reported inconsistencies span all types i.e. CONTENT_DIFFERENT_ROWS, ONLY_IN_SOURCE_TABLE_ROWS, ONLY_IN_PEER_TABLE_ROWS.  I tried an approach similar to HBASE-16423 to re-compare using a Get on the offending row, and the inconsistencies turn out to be false. 
This may suggest an issue with the VerifyReplication setup (unlikely, as and immediate Get based comparison succeeds), or scanners losing data.

The configurations that I tried were (modified VerifyReplication to take a row range as input):
1. Column families with only one max version with KEEP_DELETED_CELLS set to true and purge time to 7 days. VerifyReplication run was on a small time range two days ago. Multiple regions: reports inconsistencies, one region: reports fine (could just be due to smaller volume of data).
2. Column families with Integer.MAX_VALUE number of max versions, KEEP_DELETED_CELLS set to true, purge time set to 7 days and verification timerange two days ago. Both one region and multiple region setups report inconsistencies.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)