You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@atlas.apache.org by "Bolke de Bruin (Jira)" <ji...@apache.org> on 2019/08/22 04:30:00 UTC

[jira] [Comment Edited] (ATLAS-3370) Aggregation Metrics with quick search, Counts don't add up

    [ https://issues.apache.org/jira/browse/ATLAS-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16912945#comment-16912945 ] 

Bolke de Bruin edited comment on ATLAS-3370 at 8/22/19 4:29 AM:
----------------------------------------------------------------

Why is the indexing changed from fulltext to string based? this changes search behavior. per documentation of janus:

When a string mapping is configured, the string value is indexed and can be queried "as-is" - including stop words and non-letter characters. However, in this case the query must match the entire string value. Hence, the string mapping is useful when indexing short character sequences that are considered to be one token.

For the (dsl) search it might be that due to this change the performance suffers as the different indices make use of different predicates. This can be a 100x perf hit, as we haveseen in our system on the linked issue.


was (Author: bolke):
why is the indexing changed from fulltext to string based? this changes search behavior. per documentation of janus:

When a string mapping is configured, the string value is indexed and can be queried "as-is" - including stop words and non-letter characters. However, in this case the query must match the entire string value. Hence, the string mapping is useful when indexing short character sequences that are considered to be one token.

> Aggregation Metrics with quick search, Counts don't add up
> ----------------------------------------------------------
>
>                 Key: ATLAS-3370
>                 URL: https://issues.apache.org/jira/browse/ATLAS-3370
>             Project: Atlas
>          Issue Type: Bug
>            Reporter: Sridhar
>            Assignee: Sridhar
>            Priority: Major
>
> The issue was happening because of tokenization done for the fields in issue.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)