You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Yu Li (JIRA)" <ji...@apache.org> on 2018/07/16 06:01:00 UTC

[jira] [Resolved] (HBASE-20844) Duplicate rows returned while hbase snapshot reads

     [ https://issues.apache.org/jira/browse/HBASE-20844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yu Li resolved HBASE-20844.
---------------------------
    Resolution: Duplicate

The issue is fixed by HBASE-16011 and included in release 1.3.2

> Duplicate rows returned while hbase snapshot reads
> --------------------------------------------------
>
>                 Key: HBASE-20844
>                 URL: https://issues.apache.org/jira/browse/HBASE-20844
>             Project: HBase
>          Issue Type: Bug
>          Components: mapreduce, snapshots, spark
>    Affects Versions: 1.3.1
>         Environment: Cluster Details 
> Java 	1.7
> Hbase     1.3.1
> Spark      1.6.1
>            Reporter: ShivaKumar SS
>            Priority: Major
>
> We are trying to take snapshot from code and read data using MR and spark, both approaches are returning duplicate records.
> On the API side, \{{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat }} is used.
> Snapshot was taken during the table was in a region split state.
> We suspect it is due to data is being returned for both parent and daughter regions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)