You are viewing a plain text version of this content. The canonical link for it is here.
Posted to server-dev@james.apache.org by "Tellier Benoit (JIRA)" <ji...@apache.org> on 2019/03/25 09:59:00 UTC
[jira] [Commented] (MAILBOX-314) Search should not split email into
more than one token
[ https://issues.apache.org/jira/browse/MAILBOX-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16800534#comment-16800534 ]
Tellier Benoit commented on MAILBOX-314:
----------------------------------------
4bf777b1e8 solved this a long time ago
> Search should not split email into more than one token
> ------------------------------------------------------
>
> Key: MAILBOX-314
> URL: https://issues.apache.org/jira/browse/MAILBOX-314
> Project: James Mailbox
> Issue Type: Bug
> Components: elasticsearch
> Affects Versions: master
> Reporter: Luc DUZAN
> Priority: Major
>
> We have a tokenizing issue when using ElasticSearch.
> I wrote a mail to any@domain.com will be tokenized as:
> "I" "wrote" "a" "mail" "to" "any" "domain" "com"
> Thus if I look for "other@domain.com" it will be matching!
> We should have a more powerfull mail tokenization:
> - I should be able to look for a local part OR domain part OR full email.
> ElasticSearch brings some tokenizer for emails. We need to combine it with standard analyser.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: server-dev-unsubscribe@james.apache.org
For additional commands, e-mail: server-dev-help@james.apache.org