You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Jérôme Charron <je...@gmail.com> on 2006/12/13 23:01:07 UTC
Re: NUTCH 0.8.1: Difficulties with Analyzers
> org.apache.nutch.searcher.NutchBean query: fr?quentes
François, two basic points I would like to check first:
1. does the documents were indexed with the french analyzer activated?
2. could you perform a search with a non-accentuated query?
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/
Réf. : Re: NUTCH 0.8.1: Difficulties with Analyzers
Posted by Fr...@bnc.ca.
Hello Jérôme, merci beaucoup for getting back to me.
Here are the answers:
>1. does the documents were indexed with the french analyzer activated?
Yes, my hadoop-site.xml (in /opt/nutch-0.8/conf/) contained the
following plugin.includes before I crawled/indexed the site.
Also, the file nutch-site.xml within the webapp's WEB-INF/classes/
folder contains the same plugin.includes.
nutch-extensionpoints|language-identifier|lib-lucene-analyzers|protocol-httpclient|urlfilter-regex|parse-(text|
pdf|msword|html)|index-basic|analysis-fr|query-(basic|site|url)|summary-basic|scoring-opic
>2. could you perform a search with a non-accentuated query?
Yes, a non-accentuated search returns adequate results.
In addition, if I perform an accentuated search from a different
locale (i.e: a different location than http://myhost:8080/nutch/fr), I do
get adequate results.
___________________________________________
François McNeil
"Jérôme Charron" <je...@gmail.com>
2006-12-13 17:01
Veuillez répondre à nutch-user
Pour : nutch-user@lucene.apache.org
cc :
Objet : Re: NUTCH 0.8.1: Difficulties with Analyzers
> org.apache.nutch.searcher.NutchBean query: fr?quentes
François, two basic points I would like to check first:
1. does the documents were indexed with the french analyzer activated?
2. could you perform a search with a non-accentuated query?
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/