You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2023/02/18 14:58:00 UTC
[jira] [Commented] (NUTCH-2980) Upgrade Selenium Java to 4.7.2
[ https://issues.apache.org/jira/browse/NUTCH-2980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17690763#comment-17690763 ]
ASF GitHub Bot commented on NUTCH-2980:
---------------------------------------
sebastian-nagel commented on PR #753:
URL: https://github.com/apache/nutch/pull/753#issuecomment-1435694153
Finally, I was able to successfully test it - the reason was that on recent Ubuntu systems Firefox and Chromium are installed as snap packages. This adds extra sandboxing and requires that `TMPDIR` points to a folder the snap packages are allowed to write to (they cannot write to the default `/tmp/`).
Thanks, @KamilMroczek !
> Upgrade Selenium Java to 4.7.2
> ------------------------------
>
> Key: NUTCH-2980
> URL: https://issues.apache.org/jira/browse/NUTCH-2980
> Project: Nutch
> Issue Type: Improvement
> Components: plugin, protocol
> Affects Versions: 1.19
> Reporter: Kamil Mroczek
> Priority: Major
> Fix For: 1.20
>
>
> Selenium version is quite old and had some issues processing a website. Once I switched to the latest version I was able to scrape that websites. Good to keep it up to date since we were already 1 major release behind.
> Upgrading Selenium Java from 3.141.59 to 4.7.2 and Selenium HTMLUnit from 2.35.1 to 4.7.0.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)