You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Roger Marin <rs...@gmail.com> on 2010/08/25 17:29:54 UTC

How do I know which analyzer nutch is using during crawling/indexing?

Hello,

I implemented a plugin to change the default nutch analyzer to use the
snowball analyzer since I need stemming support, based on the french
analyzer.
When I search and the query gets parsed I can see the terms are getting
stemmed, however the search returns nothing, so I'm not sure what analyzer
nutch is using during indexing,
it doesn't seem to be picking up the analyzer from the plugin, is there a
way to get the analyzer nutch is using to index?

Thanks.

Re: How do I know which analyzer nutch is using during crawling/indexing?

Posted by Hannu Väisänen <Ha...@uef.fi>.
On Wed, Aug 25, 2010 at 11:29:54AM -0400, Roger Marin wrote:
> I implemented a plugin to change the default nutch analyzer to use the
> snowball analyzer since I need stemming support, based on the french
>
> [Query] doesn't seem to be picking up the analyzer from the plugin,


You should write your own QueryFilter that uses the same analyzer you
use for indexing and define it in your plugin.xml file.

<plugin>
   <extension id="...
              name="...
              point="org.apache.nutch.searcher.QueryFilter">
      <implementation id="YourQueryFilter"
                      class="your.package.name.YourQueryFilter">
      </implementation>
   </extension>
</plugin>