You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Alfredas Chmieliauskas <al...@gmail.com> on 2011/10/06 09:56:12 UTC

Https and fetch reject

Dear all,

I'm trying to crawl and index my svn via https. I managed to get
authentication (basic) going by reading the manual and its working. The
problem now it seems is that the fetcher rejects to index pages as the
currtime is less than fetchtime.
It seems that the page is recorded the first time it is fetched with a 401
response. But maybe I'm wrong.

Please see the log attached.

Did anyone have any success in crawling https (or svn via https)?

Will appreciate any help.

Alfredas