You are viewing a plain text version of this content. The canonical link for it is here.

Posted to users@jackrabbit.apache.org by stefan81 <st...@process-relations.com> on 2007/09/11 13:06:46 UTC

Re: Problem with content search

I figured out that documents saved under the new office 2007 formats (docx,
xlsx) are not recognized by the indexer (at least in JR version 1.3). Maybe
this is the cause for your problem. My JBoss tells me then:

[NodeIndexer] Exception while indexing binary property: java.io.IOException:
Invalid header signature; read 1688935826934608, expected
-2226271756974174256

When documents are stored under the old formats the indexing works. Are the
new formats supported by newer JR versions? Are there any efforts?

Regards

Jukka Zitting wrote:
> 
> Hi,
> 
> On 3/13/07, Malligarjunan Sidduraj
> <Ma...@webmethods.com> wrote:
>> Content of the jcr:content/jcr:mimeType property is application/msword
> 
> That should be OK. It could be that the MS Word indexer is having some
> trouble parsing the document. Do you see some errors being logged by
> the repository?
> 
> BR,
> 
> Jukka Zitting
> 
> 

-- 
View this message in context: http://www.nabble.com/Problem-with-content-search-tf3387188.html#a12612402
Sent from the Jackrabbit - Users mailing list archive at Nabble.com.

Re: Problem with content search

Posted by Marcel Reutegger <ma...@gmx.net>.

stefan81 wrote:
> When documents are stored under the old formats the indexing works. Are the
> new formats supported by newer JR versions?

no, jackrabbit does not support docx or xlsx.

 > Are there any efforts?

no, not currently. but if you create a jira issue someone might start to work on 
it ;)

regards
  marcel