You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Omar <or...@yahoo.com> on 2006/10/02 21:01:48 UTC

A couple of questions with 0.8.1

Hello,

I have Nutch up & running. I want to index URLs that contain pdf and MS Word
but seems like it didn't work. The log says:

Indexing [http://my-site/doc.pdf] with analyzer
org.apache.nutch.analysis.NutchDocumentAnalyzer@52c6b4 (null)

What does it mean? It failed to index the pdf? Why?

Also, if I want to re-crawl my site what's the command for it? 

Thanks!

Omar
-- 
View this message in context: http://www.nabble.com/A-couple-of-questions-with-0.8.1-tf2371850.html#a6607701
Sent from the Nutch - User mailing list archive at Nabble.com.