You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Servico Creator E106798-2 <cr...@telefonica.com.br> on 2005/08/31 22:01:03 UTC

searching Accented characters

    Hi,

   Is there any way to do Nutch (0.7) find accented characters (special 
characters from Brazilian Portuguese language) when I do a search without 
put the accent in the word ?

  Thanks in advance,

  Sergio Stateri Jr.
  Sao Paulo - Brazil

Re: searching Accented characters

Posted by Ken Krugler <kk...@transpac.com>.
>    Is there any way to do Nutch (0.7) find accented characters (special
>characters from Brazilian Portuguese language) when I do a search without
>put the accent in the word ?

Yes, if the parse-html code strips diacriticals. You'd also want to 
make sure your query parser does the same thing for the user's search 
terms.

-- Ken
-- 
Ken Krugler
TransPac Software, Inc.
<http://www.transpac.com>
+1 530-470-9200