You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Otis Gospodnetic <ot...@yahoo.com> on 2011/06/04 03:56:48 UTC

Re: Nutch Crawl error

Roger, wrong list.

Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



----- Original Message ----
> From: Roger Shah <rs...@caci.com>
> To: "solr-user@lucene.apache.org" <so...@lucene.apache.org>
> Sent: Thu, May 26, 2011 3:06:15 PM
> Subject: Nutch Crawl error
> 
> I ran the command bin/nutch crawl urls -dir crawl -depth 3 >&  crawl.log
> 
> When I viewed crawl.log I found some errors such  as:
> 
> Can't retrieve Tika parser for  mime-typeapplication/x-shockwave-flash, and 
>some other similar messages for  other types such as application/xml, etc.
> 
> Do I need to download Tika for  these errors to go away?  Where can I download 
>Tika so that it can work  with Nutch?  If there are instructions to install Tika 
>to work with Nutch  please send them to me.
> 
> Thanks,
> Roger
>