You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Erlend Garåsen (JIRA)" <ji...@apache.org> on 2013/06/21 14:16:20 UTC

[jira] [Comment Edited] (SOLR-4565) Extend NorwegianMinimalStemFilter to handle "nynorsk"

    [ https://issues.apache.org/jira/browse/SOLR-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13690217#comment-13690217 ] 

Erlend Garåsen edited comment on SOLR-4565 at 6/21/13 12:16 PM:
----------------------------------------------------------------

I think we still need the two methods in order to determine whether we should stem for the two variants respectively, and for readability of course. I will try to explain below.

For backward-compatibility, only Bokmål should be used if no variant is defined. Therefore, the useBokMaal method will also return true if there are no variants defined. The same applies if "no" is set as a variant. This means that both Nynorsk and Bokmål are set, and thus, useBokmaal should return true as well. I encapsulated this into methods for readability reasons.
                
      was (Author: erlendfg):
    I think we still need the two methods in order to determine whether we should stem for the two variants respectively, and for readability of course. I will try to explain below.

For backward-compatibility, only Bokmål should be used if no variant is defined. Therefore, the useBokMaal method will also return true if there are no variants defined. The same applies if "no" is set as a variant. This means that both Nynorsk and Bokmål are set, and thus, useBokmaal should return true as well. I encapsulated this into a method for readability reasons.
                  
> Extend NorwegianMinimalStemFilter to handle "nynorsk"
> -----------------------------------------------------
>
>                 Key: SOLR-4565
>                 URL: https://issues.apache.org/jira/browse/SOLR-4565
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>            Reporter: Jan Høydahl
>         Attachments: SOLR-4565.patch
>
>
> Norway has two official languages, both called "Norwegian", namely Bokmål (nb_NO) and Nynorsk (nn_NO).
> The NorwegianMinimalStemFilter and NorwegianLightStemFilter today only works with the largest of the two, namely Bokmål.
> Propose to incorporate "nn" support through a new "vaiant" config option:
> * variant="nb" or not configured -> Bokmål as today
> * variant="nn" -> Nynorsk only
> * variant="no" -> Remove stems for both nb and nn

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org