You are viewing a plain text version of this content. The canonical link for it is here.
Posted to server-dev@james.apache.org by "Benoit Tellier (Jira)" <se...@james.apache.org> on 2021/08/16 02:52:00 UTC

[jira] [Closed] (JAMES-3626) Setting value of a collection inserts tomstone

     [ https://issues.apache.org/jira/browse/JAMES-3626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Benoit Tellier closed JAMES-3626.
---------------------------------
    Resolution: Fixed

> Setting value of a collection inserts tomstone
> ----------------------------------------------
>
>                 Key: JAMES-3626
>                 URL: https://issues.apache.org/jira/browse/JAMES-3626
>             Project: James Server
>          Issue Type: Improvement
>          Components: cassandra
>    Affects Versions: master
>            Reporter: Benoit Tellier
>            Priority: Major
>              Labels: cassandra, perf
>             Fix For: 3.7.0
>
>          Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> https://docs.datastax.com/en/landing_page/doc/landing_page/dataModel.html#dataModel__checkTableStructure
> {code:java}
> Check use of collection types
> When insert or full update of a non-frozen collection occurs, such as replacing the value of the column with another value like UPDATE table SET field = new_value …, Cassandra inserts a tombstone marker to prevent possible overlap with previous data even if data did not previously exist. A large number of tombstones can significantly affect read performance.
> When you know that no previous data exists and to prevent creation of tombstones when inserting data into a set or map (or when performing the full update of a set or map), you can use append operation for columns. For example:
> CREATETABLE test.m1 (
> id int PRIMARY KEY,
> m map<int, text>
> );
> instead of using:
> INSERT INTO test.m1(id, m) VALUES (1, {1:'t1', 2:'t2'}); 
> or
> UPDATE test.m1 SET m = {1:'t1', 2:'t2'} WHERE id = 1; 
> which generate tombstones, execute:
> UPDATE test.m1 SET m = m + {1:'t1', 2:'t2'} WHERE id = 1; 
> which has the same result, but without tombstone generation.
> {code}
> Affected tables:
>  - imapUidTable (use of a non frozen set for user flags)
>       - inserts are done
>       - full updates are done
>       - This table is critical for JMAP performance. Not that it can explain the ~10% performance hit that occurs when reading mailbox with recent inserts versus reading mailboxes without recent inserts.
>  - messageIdTable (use of a non frozen set for user flags)
>       - inserts are done
>       - full updates are done
>       - This table is critical for IMAP performance.
>  - enqueuedMailsV3 (non frozen list data types are mistakenly used for RECIPIENTS and PER_RECIPIENT_SPECIFIC_HEADERS - frozen would be acceptable - map is used for attributed and could be frozen too)
>       - As a temporary measure, we can fix the inserts.
>       - This table is critical to the browse start update performance
>  - mailRepositoryContentV2 (non frozen list data types are mistakenly used for RECIPIENTS and PER_RECIPIENT_SPECIFIC_HEADERS - frozen would be acceptable- map is used for attributed and could be frozen too)
>       - As a temporary measure, we can fix the inserts.
>       - This table is not critical



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: server-dev-unsubscribe@james.apache.org
For additional commands, e-mail: server-dev-help@james.apache.org