You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@phoenix.apache.org by "Abhishek Talluri (JIRA)" <ji...@apache.org> on 2018/10/18 15:06:00 UTC

[jira] [Comment Edited] (PHOENIX-4980) Mismatch in row counts between data and index tables while multiple clients try to upsert data

    [ https://issues.apache.org/jira/browse/PHOENIX-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16655367#comment-16655367 ] 

Abhishek Talluri edited comment on PHOENIX-4980 at 10/18/18 3:05 PM:
---------------------------------------------------------------------

[~vincentpoon] I have also seen a case where each of the csv had only 50k records == max of 50k primary keys, i have used 2 threads by keeping same PK for the each record, whereas when i do a distinct on primary key i get more than 60k entries.  


was (Author: abhishektalluri@cloudera.com):
[~vincentpoon] I have also seen a case where the csv had only 50k records == max of 50k primary keys, whereas when i do a distinct on primary key i get more than 60k entries.  

> Mismatch in row counts between data and index tables while multiple clients try to upsert data
> ----------------------------------------------------------------------------------------------
>
>                 Key: PHOENIX-4980
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-4980
>             Project: Phoenix
>          Issue Type: Bug
>    Affects Versions: 4.14.0
>            Reporter: Abhishek Talluri
>            Priority: Major
>              Labels: LocalIndex, globalMutableSecondaryIndex, secondaryIndex
>         Attachments: TestSecIndex.java
>
>
> Phoenix table has A,B,C,D,E as its columns and A as the primary key for the table.
> CREATE TABLE TEST (A VARCHAR NOT NULL PRIMARY KEY, B VARCHAR, C VARCHAR, D VARCHAR , E VARCHAR);
> Global index is built on D & E
> CREATE INDEX TEST_IND on TEST (D,E);
> Client 1 updates A,B,C whereas client 2 updates A,B,D,E
> I used phoenix 5.14.2-1.cdh5.14.2.p0.3 parcel to test this issue. Ran with two threads that load data using upserts reading from the csv file. Within 10 iterations, i could observe the difference in the row counts between data table and index table. Attaching the code used to test this behavior. This issue also exists in both Global and Local indexes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)