You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Omar <or...@yahoo.com> on 2006/10/02 21:01:48 UTC
A couple of questions with 0.8.1
Hello,
I have Nutch up & running. I want to index URLs that contain pdf and MS Word
but seems like it didn't work. The log says:
Indexing [http://my-site/doc.pdf] with analyzer
org.apache.nutch.analysis.NutchDocumentAnalyzer@52c6b4 (null)
What does it mean? It failed to index the pdf? Why?
Also, if I want to re-crawl my site what's the command for it?
Thanks!
Omar
--
View this message in context: http://www.nabble.com/A-couple-of-questions-with-0.8.1-tf2371850.html#a6607701
Sent from the Nutch - User mailing list archive at Nabble.com.