You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Robert Muir (JIRA)" <ji...@apache.org> on 2014/03/08 18:10:43 UTC

[jira] [Resolved] (LUCENE-5505) hunspell SET/FLAG whitespace/BOM handling

     [ https://issues.apache.org/jira/browse/LUCENE-5505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir resolved LUCENE-5505.
---------------------------------

       Resolution: Fixed
    Fix Version/s: 5.0
                   4.8

> hunspell SET/FLAG whitespace/BOM handling
> -----------------------------------------
>
>                 Key: LUCENE-5505
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5505
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Robert Muir
>             Fix For: 4.8, 5.0
>
>         Attachments: LUCENE-5505.patch
>
>
> Several dictionaries cannot be loaded today (Armenian, Papiamento, Macedonian, Russian, Urdu) because they have stuff like SET<tab>UTF-8, FLAG<space><space>UTF-8 or have a BOM marker on the first line (or even combinations of these).
> Also because SET need not be the first line in the file, we should ignore BOM markers on the first line in general (e.g. it might be something else like FLAG).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org