You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@solr.apache.org by "Jan Høydahl (Jira)" <ji...@apache.org> on 2021/08/19 09:04:00 UTC

[jira] [Commented] (SOLR-14801) Multiple Language Detection is not reflecting properly with apache Tika/Solr Jar ()

    [ https://issues.apache.org/jira/browse/SOLR-14801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17401556#comment-17401556 ] 

Jan Høydahl commented on SOLR-14801:
------------------------------------

[~navoditbansod]  Please provide more details on how to reproduce this bug. Do you have a test document we can use as well as a configuration?

> Multiple Language Detection is not reflecting properly with apache Tika/Solr Jar ()
> -----------------------------------------------------------------------------------
>
>                 Key: SOLR-14801
>                 URL: https://issues.apache.org/jira/browse/SOLR-14801
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - LangId
>            Reporter: Navodit Bansod
>            Priority: Major
>
> Hi Team,
> Please find  the following issues occurring in case of multiple lang detection in apache Solr :
>  # Primary and Secondary language is not getting detected using separate fields/attributes for each. The language is getting generalized with the language having major chunk of data and thus reflect as same is both fields - "lang and langs" (attribute primary and secondary language)
>  # The Distance(or length) setting parameter in solrconfig.xml is properly SET in our cluster but still it seems this parameter is not showing any difference with change of values. (
> <str name="langid.threshold">0.2</str>)
>  # Following Versions are being used in our solr cloud setup: 
>  # tika-core-1.24.1.jar
>  # tika-parsers-1.24.1.jar
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@solr.apache.org
For additional commands, e-mail: issues-help@solr.apache.org