You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Ravi Chintakunta (JIRA)" <ji...@apache.org> on 2007/08/08 18:44:59 UTC

[jira] Created: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

HttpClient plugin does not work with BasicAuthentication
--------------------------------------------------------

                 Key: NUTCH-539
                 URL: https://issues.apache.org/jira/browse/NUTCH-539
             Project: Nutch
          Issue Type: Bug
          Components: fetcher
    Affects Versions: 0.8
            Reporter: Ravi Chintakunta
            Priority: Minor


For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials. 

For this to work:

1. Add the username and password credentials to nutch-site.xml as below:

<property>
  <name>http.auth.basic.username</name>
  <value>myusername</value>
  <description>
	username for http basic auth
  </description>
</property>

<property>
  <name>http.auth.basic.password</name>
  <value>mypassword</value>
  <description>
	password for http basic auth
  </description>
</property>

2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

Posted by "Ravi Chintakunta (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ravi Chintakunta updated NUTCH-539:
-----------------------------------

    Attachment: httpclient_http.patch

Patch for nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java to fix this issue

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Priority: Minor
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials. 
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> 	username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> 	password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

Posted by "Alexis Votta (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12530165 ] 

Alexis Votta commented on NUTCH-539:
------------------------------------

This patch doesn't work for me. I don't get any useful debug logs too.

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Priority: Minor
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials. 
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> 	username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> 	password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

Posted by "Susam Pal (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12530175 ] 

susam edited comment on NUTCH-539 at 9/25/07 10:54 AM:
-----------------------------------------------------------

1. There is a bug in the patch. The domain name is hardcoded in the AuthScope constructor. It should be the domain name extracted from the URL being fetched.
2. This patch removes NTLM authentication scheme support from protocol-httpclient. Both NTLM and digest authentication schemes should be supported.

These two issues have been fixed in NUTCH-559.

      was (Author: susam):
    1. There is a bug in the patch. The domain name is hardcoded in the AuthScope constructor. It should be the domain name extracted from the URL being fetched.
2. This patch removes NTLM authentication scheme support from protocol-httpclient. Both NTLM and digest authentication schemes should be supported.

The following two issues have been fixed in NUTCH-559.
  
> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Priority: Minor
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials. 
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> 	username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> 	password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

Posted by "Doğacan Güney (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doğacan Güney closed NUTCH-539.
-------------------------------

       Resolution: Fixed
    Fix Version/s: 1.0.0
         Assignee: Doğacan Güney

Fixed as part of NUTCH-559.

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Assignee: Doğacan Güney
>            Priority: Minor
>             Fix For: 1.0.0
>
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials. 
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> 	username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> 	password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.