You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/05/27 00:56:26 UTC

[Nutch Wiki] Update of "MultiLingualSupport" by KurosakaTeruhiko

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by KurosakaTeruhiko:
http://wiki.apache.org/nutch/MultiLingualSupport

The comment on the change is:
Added notes, based on observation of Nutch 0.8 candidate code.

------------------------------------------------------------------------------
  }}}
  so that, the {{{IndexWriter}}} is called with the good {{{Analyzer}}} implementation.
   
+   (Note by KurosakaTeruhiko) This seems to have been implemented in Nutch 0.8.  The following lines were found in {{{Indexer}}}, not {{{IndexSegment}}} which no longer exists in Nutch 0.8:
+   {{{
+ final AnalyzerFactory factory = new AnalyzerFactory(job);
+  .
+  .
+  .
+ NutchAnalyzer analyzer = factory.get(doc.get("lang"));
+   }}}
   
  Second, the {{{NutchDocumentAnalyzer}}} class must implement the {{{NutchAnalyzer}}} class.
+   (Note by KurosakaTeruhiko) This has been done in Nutch 0.8.  
  
  == Query Analysis ==