You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/04 15:08:34 UTC

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

    [ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13406495#comment-13406495 ] 

Ferdy Galema commented on NUTCH-1360:
-------------------------------------

Sorry for the late response, but this issue is not properly implemented (for both branch and trunk).

- IP is always stored instead of depending on property: headers.set("_ip",... should be done only if http.getIP_Header() is true.

- http.store.ip.address appends the _ip:<true or false> property to the request string? What is the purpose of that? If not intentional, we should simply revert this. On top of that it uses the property with a default of "true", but is should be "false" if the adding to request string is intentional.

Thanks.

                
> Suport the storing of IP address connected to when web crawling
> ---------------------------------------------------------------
>
>                 Key: NUTCH-1360
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1360
>             Project: Nutch
>          Issue Type: New Feature
>          Components: protocol
>    Affects Versions: nutchgora, 1.5
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: nutchgora, 1.6
>
>         Attachments: NUTCH-1360-nutchgora-v2.patch, NUTCH-1360-nutchgora.patch, NUTCH-1360-trunk.patch
>
>
> Simple issue enabling us to capture the specific IP address of the host which we connect to to fetch a page.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira