You are viewing a plain text version of this content. The canonical link for it is here.
Posted to server-dev@james.apache.org by "Tellier Benoit (JIRA)" <ji...@apache.org> on 2019/03/25 09:59:00 UTC

[jira] [Commented] (MAILBOX-314) Search should not split email into more than one token

    [ https://issues.apache.org/jira/browse/MAILBOX-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16800534#comment-16800534 ] 

Tellier Benoit commented on MAILBOX-314:
----------------------------------------

4bf777b1e8 solved this a long time ago

> Search should not split email into more than one token
> ------------------------------------------------------
>
>                 Key: MAILBOX-314
>                 URL: https://issues.apache.org/jira/browse/MAILBOX-314
>             Project: James Mailbox
>          Issue Type: Bug
>          Components: elasticsearch
>    Affects Versions: master
>            Reporter: Luc DUZAN
>            Priority: Major
>
> We have a tokenizing issue when using ElasticSearch. 
> I wrote a mail to any@domain.com will be tokenized as:
> "I" "wrote" "a" "mail" "to" "any" "domain" "com"
> Thus if I look for "other@domain.com" it will be matching!
> We should have a more powerfull mail tokenization:
>  - I should be able to look for a local part OR domain part OR full email.
> ElasticSearch brings some tokenizer for emails. We need to combine it with standard analyser.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: server-dev-unsubscribe@james.apache.org
For additional commands, e-mail: server-dev-help@james.apache.org