You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2003/09/30 03:08:13 UTC
DO NOT REPLY [Bug 23505] New: -
Russian Analyzer assumes default encoding is iso-8859-1
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23505>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND
INSERTED IN THE BUG DATABASE.
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23505
Russian Analyzer assumes default encoding is iso-8859-1
Summary: Russian Analyzer assumes default encoding is iso-8859-1
Product: Lucene
Version: CVS Nightly - Specify date in submission
Platform: Macintosh
OS/Version: MacOS X
Status: NEW
Severity: Normal
Priority: Other
Component: Analysis
AssignedTo: lucene-dev@jakarta.apache.org
ReportedBy: hani@formicary.net
On OSX, the default encoding is MacRoman, so this causes TestRussianAnalyzer to fail, since the
file is not read in correctly.
The correct solution is to explicitly specify that the test file should be read using iso-8859-1
encoding. I've attached a patch.
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org