You are viewing a plain text version of this content. The canonical link for it is here.
Posted to server-dev@james.apache.org by "Benoit Tellier (Jira)" <se...@james.apache.org> on 2022/02/16 05:03:00 UTC

[jira] [Created] (JAMES-3713) DLP matcher: enable caching

Benoit Tellier created JAMES-3713:
-------------------------------------

             Summary: DLP matcher: enable caching
                 Key: JAMES-3713
                 URL: https://issues.apache.org/jira/browse/JAMES-3713
             Project: James Server
          Issue Type: Improvement
          Components: data, Mailet Contributions
    Affects Versions: master
            Reporter: Benoit Tellier


Querying DLP rules on top of Cassandra relies on Event sourcing, involves reading a potentially large event stream and involves some SERIAL reads (LightWeight transactions) for each processed emails. Conducting flame graphs on Cassandra tend to show 20% of its activity consists of Paxos transactions for LWTs and in my setup DLP accounts for ~10% of the LWTs.

DLP rules depending on the domain of the sender, the cardinaility is extremely low.

Also, we can afford a window of inconsistencies in rules application (eg a minute)

This makes caching very appealing...



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: server-dev-unsubscribe@james.apache.org
For additional commands, e-mail: server-dev-help@james.apache.org