You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2014/03/08 18:08:43 UTC

[jira] [Commented] (LUCENE-5505) hunspell SET/FLAG whitespace/BOM handling

    [ https://issues.apache.org/jira/browse/LUCENE-5505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13924928#comment-13924928 ] 

ASF subversion and git services commented on LUCENE-5505:
---------------------------------------------------------

Commit 1575568 from [~rcmuir] in branch 'dev/trunk'
[ https://svn.apache.org/r1575568 ]

LUCENE-5505: hunspell SET/FLAG whitespace and BOM handling

> hunspell SET/FLAG whitespace/BOM handling
> -----------------------------------------
>
>                 Key: LUCENE-5505
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5505
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Robert Muir
>         Attachments: LUCENE-5505.patch
>
>
> Several dictionaries cannot be loaded today (Armenian, Papiamento, Macedonian, Russian, Urdu) because they have stuff like SET<tab>UTF-8, FLAG<space><space>UTF-8 or have a BOM marker on the first line (or even combinations of these).
> Also because SET need not be the first line in the file, we should ignore BOM markers on the first line in general (e.g. it might be something else like FLAG).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org