You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Otis Gospodnetic <ot...@yahoo.com> on 2011/06/04 03:56:48 UTC
Re: Nutch Crawl error
Roger, wrong list.
Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
----- Original Message ----
> From: Roger Shah <rs...@caci.com>
> To: "solr-user@lucene.apache.org" <so...@lucene.apache.org>
> Sent: Thu, May 26, 2011 3:06:15 PM
> Subject: Nutch Crawl error
>
> I ran the command bin/nutch crawl urls -dir crawl -depth 3 >& crawl.log
>
> When I viewed crawl.log I found some errors such as:
>
> Can't retrieve Tika parser for mime-typeapplication/x-shockwave-flash, and
>some other similar messages for other types such as application/xml, etc.
>
> Do I need to download Tika for these errors to go away? Where can I download
>Tika so that it can work with Nutch? If there are instructions to install Tika
>to work with Nutch please send them to me.
>
> Thanks,
> Roger
>