You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Filip Stysiak <st...@gmail.com> on 2017/05/30 14:50:52 UTC

Configuring protocol-selenium

Hello everyone,

I want to test using Nutch with protocol-selenium plugin and I follow the
instructions on Github page of the plugin:
https://github.com/apache/nutch/tree/trunk/src/plugin/protocol-selenium

however, I think I did something wrong, since after starting selenium hub
in one terminal and starting a selenium node in the other and changing
protocol-http to protocol-selenium in nutch-site.xml configuration property
(plugin.includes) nothing gets fetched.

I left the selenium-plugin configuration the way they are on the github
site and I feel I should configure at least a bunch of them. Which ones?
Does anyone have experience with protocol-selenium setup? Is the "run
headless" part crucial?