You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@phoenix.apache.org by "Karan Mehta (JIRA)" <ji...@apache.org> on 2018/11/02 21:20:00 UTC

[jira] [Updated] (PHOENIX-4997) Phoenix MR on snapshots can produce duplicate rows

     [ https://issues.apache.org/jira/browse/PHOENIX-4997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Karan Mehta updated PHOENIX-4997:
---------------------------------
    Attachment: PHOENIX-4997.4.x-HBase-1.4.001.patch

> Phoenix MR on snapshots can produce duplicate rows
> --------------------------------------------------
>
>                 Key: PHOENIX-4997
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-4997
>             Project: Phoenix
>          Issue Type: Bug
>            Reporter: Karan Mehta
>            Assignee: Karan Mehta
>            Priority: Major
>             Fix For: 4.15.0, 5.1.0
>
>         Attachments: PHOENIX-4997.4.x-HBase-1.4.001.patch, PHOENIX-4997.master.001.patch, PHOENIX-4997.master.002.patch
>
>
> Phoenix MR over snapshots uses TableSnapshotResultIterator and SnapshotScanner classes for iterating/scanning over snapshots. They had been copied over from HBase classes TableSnapshotScanner and ClientSideRegionScanner classes and modified according to Phoenix requirements. This decision was taken since some of fields of these classes were private and hence it is not possible to reuse them. HBASE-8369 is the main Jira.
> The framework had a bug which was fixed as part of HBASE-16011. However the fix was not ported to Phoenix and hence Phoenix MR over snapshots still continues to have it. This Jira is to fix that issue.
> FYI [~akshita.malhotra] 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)