You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@jackrabbit.apache.org by Andrej Kuročenko <ku...@gmail.com> on 2012/01/21 10:32:59 UTC

SearchIndex, textFilterClasses deprecated

Hi,

I'm configuring Jackrabbit 2.3.6 and I need to index binary files (PDF, 
ODT). So I've configured SearchIndex in repository.xml according to 
http://wiki.apache.org/jackrabbit/Search. But when I insert file (like 
here http://stackoverflow.com/a/5169893/816456), then try to full-text 
search, no results are returned.

Then I noticed warning in logs:
SearchIndex.java:2087  The textFilterClasses configuration parameter has 
been deprecated, and the configured value will be ignored: 
org.apache.jackrabbit.extractor.PlainTextExtractor,org.apache.jackrabbit.extractor.PdfTextExtractor,org.apache.jackrabbit.extractor.OpenOfficeTextExtractor

How do I have to configure SearchIndex to index binary data? Now I am 
doing it like this:

<SearchIndex class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
<param name="path" value="${rep.home}/repository/index"/>
<param name="textFilterClasses" 
value="org.apache.jackrabbit.extractor.PdfTextExtractor,org.apache.jackrabbit.extractor.OpenOfficeTextExtractor"/>
<param name="supportHighlighting" value="true"/>
</SearchIndex>

Thanks for replies,

Andrej.