You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Luca Cavanna (JIRA)" <ji...@apache.org> on 2012/04/24 14:51:35 UTC

[jira] [Commented] (LUCENE-3976) Improve error messages for unsupported Hunspell formats

    [ https://issues.apache.org/jira/browse/LUCENE-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13260490#comment-13260490 ] 

Luca Cavanna commented on LUCENE-3976:
--------------------------------------

We found out that some recent dutch dictionaries contain rule like the one mentioned (Starting from version 2.00 if I'm correct). I'm going to look at that specific problem and see how we can parse those affix rules.
                
> Improve error messages for unsupported Hunspell formats
> -------------------------------------------------------
>
>                 Key: LUCENE-3976
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3976
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>            Reporter: Chris Male
>         Attachments: LUCENE-3976.patch
>
>
> Our hunspell implementation is never going to be able to support the huge variety of formats that are out there, especially since our impl is based on papers written on the topic rather than being a pure port.
> Recently we ran into the following suffix rule:
> {noformat}SFX CA 0 /CaCp{noformat}
> Due to the missing regex conditional, an AOE was being thrown, which made it difficult to diagnose the problem.
> We should instead try to provide better error messages showing what we were unable to parse.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org