You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Davide Giannella (JIRA)" <ji...@apache.org> on 2019/01/12 00:54:00 UTC

[jira] [Updated] (OAK-7961) LSH should truncate to first decimal place

     [ https://issues.apache.org/jira/browse/OAK-7961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Davide Giannella updated OAK-7961:
----------------------------------
    Fix Version/s: 1.11.0

> LSH should truncate to first decimal place
> ------------------------------------------
>
>                 Key: OAK-7961
>                 URL: https://issues.apache.org/jira/browse/OAK-7961
>             Project: Jackrabbit Oak
>          Issue Type: Task
>          Components: lucene
>            Reporter: Tommaso Teofili
>            Assignee: Tommaso Teofili
>            Priority: Major
>             Fix For: 1.9.14, 1.11.0
>
>
> Feature vector shingles currently contain three decimal places, e.g. _[0.101, 0.202, 0.303]_.
> Since min hash is not aware of the fact that second and third decimal place values are less important than first place values, it is likely that this will impact the search precision.
> We should be truncating to the first decimal place.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)