You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ponymail.apache.org by sebbASF <gi...@git.apache.org> on 2017/01/07 22:39:41 UTC

[GitHub] incubator-ponymail issue #329: Bug: 'hot topics' feature should use terms, n...

GitHub user sebbASF opened an issue:

    https://github.com/apache/incubator-ponymail/issues/329

    Bug: 'hot topics' feature should use terms, not significant_terms

    The 'word cloud' option turns on the 'hot topics' under the statistics panel.
    
    The topics are currently generated using the ES 'significant_terms' aggregation.
    
    If the intention is to show the hot topics for the selected messages, then I think this is the wrong aggregation. 
    
    Significant_terms shows terms in the selection which occur more frequently relative to the average in the background - which in this case is the mbox index. I think it would make more sense to show the busiest terms for the selection only.
    
    This would have the advantage that the aggregation will work on ES 5.x without needing to enable fielddata on the subject field.
    
    The aggregation can also be filtered by providing exclusions, e.g. the following will exclude words or 3 letters or fewer:
    
    "exclude": ".|..|..."
    
    The exclusion list could be defined as a config item, with a suitable default (e.g. as above)

----

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

[GitHub] incubator-ponymail issue #329: Bug: 'hot topics' feature should use terms, n...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the issue at:

    https://github.com/apache/incubator-ponymail/issues/329


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

[GitHub] incubator-ponymail issue #329: Bug: 'hot topics' feature should use terms, n...

Posted by sebbASF <gi...@git.apache.org>.
Github user sebbASF commented on the issue:

    https://github.com/apache/incubator-ponymail/issues/329
  
    In fact the 'terms' aggregation also does not work on ES 5.x for text fields.
    (The other aggregations are all done on keyword fields)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---