You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Philipp Chudinov <mo...@basko.ru> on 2001/11/28 23:33:36 UTC

should non-English docs be indexed as UTF-8 encoded?

Hi!
I have some xml documents (encoded as windows-1251(russian)). Should they be
converted to UTF-8 encoded documents to be indexed? Or I can index them in
windows-1251 and just encode query (search) string to UTF-8?


--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>