You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Max Dzyuba <ma...@comintelli.com> on 2012/09/19 16:37:08 UTC

HTTP Authentication (basic) in Nutch 1.5

Hi all,

 

I'm testing Nutch's ability to pass HTTP Auth and cannot succeed. I followed
a guide here:

http://wiki.apache.org/nutch/HttpAuthenticationSchemes

 

My httpclient-auth.xml file looks like this:

 

<auth-configuration>

 

<credentials username="username" password="password">

<authscope host="host.org" port="80" realm="realm" scheme="basic" />

<authscope host="host.org" port="8080"/>

</credentials>

<credentials username="username" password="password">

<default/>

</credentials>

 

</auth-configuration>

 

The log file has these records:

 

2012-09-19 16:26:16,106 DEBUG auth.AuthChallengeProcessor - Supported
authentication schemes in the order of preference: [ntlm, digest, basic]

2012-09-19 16:26:16,106 DEBUG auth.AuthChallengeProcessor - Challenge for
ntlm authentication scheme not available

2012-09-19 16:26:16,106 DEBUG auth.AuthChallengeProcessor - Challenge for
digest authentication scheme not available

2012-09-19 16:26:16,106 INFO  auth.AuthChallengeProcessor - basic
authentication scheme selected

2012-09-19 16:26:16,106 DEBUG auth.AuthChallengeProcessor - Using
authentication scheme: basic

2012-09-19 16:26:16,106 DEBUG auth.AuthChallengeProcessor - Authorization
challenge processed

2012-09-19 16:26:16,106 INFO  httpclient.HttpMethodDirector - No credentials
available for BASIC 'realm'@host.org:80

 

I don't understand why Nutch complains about "No credentials available for
BASIC 'realm'@host.org:80" since I've set up the default credentials which
should be used for any page that asks for authentication.

 

Can anyone direct me on what else I have to check?

 

 

Thanks a lot in advance,

Max


Re: HTTP Authentication (basic) in Nutch 1.5

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi,

On Wed, Sep 19, 2012 at 3:37 PM, Max Dzyuba <ma...@comintelli.com> wrote:

>
> 2012-09-19 16:26:16,106 INFO  httpclient.HttpMethodDirector - No credentials
> available for BASIC 'realm'@host.org:80
>
>
>
> I don't understand why Nutch complains about "No credentials available for
> BASIC 'realm'@host.org:80" since I've set up the default credentials which
> should be used for any page that asks for authentication.
>

If I follow the above link I get a popup box saying that the site does
not require authentication credentials and that it is trying to trick
me.

Are you sure its not just this site and that another solution is required?

Lewis