You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Fabio Santagostino (JIRA)" <ji...@apache.org> on 2015/02/15 21:03:12 UTC

[jira] [Commented] (NUTCH-1086) Rewrite protocol-httpclient

    [ https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14322166#comment-14322166 ] 

Fabio Santagostino commented on NUTCH-1086:
-------------------------------------------

Hi,
I've done an attempt to rewrite the component using httpclient 4.4. It works for me !
My main goal was to use a correct implementation of NTLMv2 auhentication for my corporate web sites.
Anyway it seams to be backward compatible  with previous implementation. Proxy support is the only part I've not tested yet.

I had to change only 2 classes (in attachment)  :
- /src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java
- /src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpResponse.java


Of course package dependency files must be modified also. In /ivy/ivy.xml :

+ added httpclient 4.4 version
{code:xml}
  <dependency org="org.apache.httpcomponents" name="httpclient" rev="4.4" conf="*->master" />
{code}

+ updated codec version from {code:xml}<dependency org="commons-codec" name="commons-codec" rev="1.3" conf="*->default" />{code}  to {code:xml}<dependency org="commons-codec" name="commons-codec" rev="1.4" conf="*->default" />{code}

Files in attachment are tested for v1.9 branch, but probably minor changes are needed to make it suitable for v2.3.

Regards,
Fabio

> Rewrite protocol-httpclient
> ---------------------------
>
>                 Key: NUTCH-1086
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1086
>             Project: Nutch
>          Issue Type: Improvement
>          Components: protocol
>    Affects Versions: nutchgora, 1.5
>            Reporter: Markus Jelsma
>             Fix For: 2.4
>
>         Attachments: Http.java, HttpResponse.java
>
>
> There are several issues about protocol-httpclient and several comments about rewriting the plugin with the new http client libraries. There is, however, not yet an issue for rewriting/reimplementing protocol-httpclient.
> http://hc.apache.org/httpcomponents-client-ga/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)