You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/07/07 00:24:32 UTC

[Nutch Wiki] Trivial Update of "GettingNutchRunningWithUtf8" by RenaudRichardet

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by RenaudRichardet:
http://wiki.apache.org/nutch/GettingNutchRunningWithUtf8

------------------------------------------------------------------------------
  = How to Configure App Servers to Pass non-ASCII Characters? =
  Nutch GUI uses the GET method to pass the query strings to the server.  Tomcat 4 and 5 need to be configured to enable passing of non-ASCII characters.
  
- Note that this note describes how to make Tomcat pass non-ASCII characters.  Nutch, in its "factory set" configuration, handle only limited characters.  Especially, it will not handle Chinese/Japanese/Korean text properly.  (Each CJK character is treated as if it were a word by itself.)
+ Note that this note describes how to make Tomcat pass non-ASCII characters.  Nutch, in its "factory set" configuration, handle only limited characters.  Especially, it will not handle Chinese/Japanese/Korean text properly.  (Each CJK character is treated as if it were a word by itself.) German special chars are also wrongly displayed (ö, ä, ü).
  
  
  == Tomcat 4 and Tomcat 5 ==