You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2003/09/30 03:08:13 UTC

DO NOT REPLY [Bug 23505] New: - Russian Analyzer assumes default encoding is iso-8859-1

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23505>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23505

Russian Analyzer assumes default encoding is iso-8859-1

           Summary: Russian Analyzer assumes default encoding is iso-8859-1
           Product: Lucene
           Version: CVS Nightly - Specify date in submission
          Platform: Macintosh
        OS/Version: MacOS X
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: Analysis
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: hani@formicary.net


On OSX, the default encoding is MacRoman, so this causes TestRussianAnalyzer to fail, since the 
file is not read in correctly.

The correct solution is to explicitly specify that the test file should be read using iso-8859-1 
encoding. I've attached a patch.

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org