You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by dan <da...@gmail.com> on 2012/01/31 11:56:52 UTC

Does nutch give the ability to parse and save file headers?

I'm building a search app using nutch. 
I'll use nutch to crawl both local file system and internet.
I need to later on have the file headers fields searchable.
So:
1. when I'm crawling internet, does nutch has a built-in abillity to save
configurable http response fields?
2. when i'm crawling local file system, does nutch has a built-in abillity
to save file header fields, like date-created, date-modified, owner?

Thanks.

--
View this message in context: http://lucene.472066.n3.nabble.com/Does-nutch-give-the-ability-to-parse-and-save-file-headers-tp3702884p3702884.html
Sent from the Nutch - User mailing list archive at Nabble.com.