You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Robert Muir (JIRA)" <ji...@apache.org> on 2017/11/20 14:24:05 UTC

[jira] [Commented] (LUCENE-8053) Similarities should round the length up

    [ https://issues.apache.org/jira/browse/LUCENE-8053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16259283#comment-16259283 ] 

Robert Muir commented on LUCENE-8053:
-------------------------------------

frequencies can always be larger than the length because of synonym stacking (discount_overlaps) so I'm not sure its worth doing this?

> Similarities should round the length up
> ---------------------------------------
>
>                 Key: LUCENE-8053
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8053
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Adrien Grand
>            Priority: Minor
>
> The encoding that we use for lengths currently rounds down in case the length cannot be stored accurately. We should round up instead so that frequencies can never be larger than the length.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org