You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@jackrabbit.apache.org by Apache Wiki <wi...@apache.org> on 2009/08/11 09:17:19 UTC

[Jackrabbit Wiki] Update of "Search" by ThomasMueller

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Jackrabbit Wiki" for change notification.

The following page has been changed by ThomasMueller:
http://wiki.apache.org/jackrabbit/Search

------------------------------------------------------------------------------
   * Define index aggregates, rules and scores: ["IndexingConfiguration"]
   * Check spelling of a fulltext query statement: ["SpellChecker"]
  
+ == Fulltext Indexing of Chinese, Japanese and Korea ==
+ 
+ To index documents written in one of those languages, use the analyzer {{{org.apache.lucene.analysis.cjk.CJKAnalyzer}}}. Due to a limitation of PDFBox, some PDF files may not be indexed at all or indexed correctly. If this is the case, a warning message is written to the log file ("Failed to extract PDF text content").
+ 
  == Further Development ==
  
   * ["ReduceMemOfSharedFieldCache"]