You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/05/27 00:56:26 UTC
[Nutch Wiki] Update of "MultiLingualSupport" by KurosakaTeruhiko
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by KurosakaTeruhiko:
http://wiki.apache.org/nutch/MultiLingualSupport
The comment on the change is:
Added notes, based on observation of Nutch 0.8 candidate code.
------------------------------------------------------------------------------
}}}
so that, the {{{IndexWriter}}} is called with the good {{{Analyzer}}} implementation.
+ (Note by KurosakaTeruhiko) This seems to have been implemented in Nutch 0.8. The following lines were found in {{{Indexer}}}, not {{{IndexSegment}}} which no longer exists in Nutch 0.8:
+ {{{
+ final AnalyzerFactory factory = new AnalyzerFactory(job);
+ .
+ .
+ .
+ NutchAnalyzer analyzer = factory.get(doc.get("lang"));
+ }}}
Second, the {{{NutchDocumentAnalyzer}}} class must implement the {{{NutchAnalyzer}}} class.
+ (Note by KurosakaTeruhiko) This has been done in Nutch 0.8.
== Query Analysis ==