You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/11/22 15:24:00 UTC

[jira] [Resolved] (NUTCH-2126) Use selenium protocol for specific sites

     [ https://issues.apache.org/jira/browse/NUTCH-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel resolved NUTCH-2126.
------------------------------------
    Fix Version/s: 1.16
       Resolution: Duplicate

This has been implemented in 1.16, see NUTCH-2678.

> Use selenium protocol for specific sites
> ----------------------------------------
>
>                 Key: NUTCH-2126
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2126
>             Project: Nutch
>          Issue Type: Sub-task
>          Components: fetcher
>            Reporter: Asitang Mishra
>            Priority: Major
>             Fix For: 1.16
>
>
> Right now if one uses selenium or seleniuminteractive plugins. The fetcher uses them for all the fetches. There will be situations where we don't want to go through the overhead of using selenium for all the seeds. 
> Can provide some standardized key value pairs tell the protocol recognizer in nutch that certain seeds will be used with selenium plugin. Later on we can keep appending these key value pairs to the outlinks or only outlinks that are of the same domain.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)