You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (Jira)" <ji...@apache.org> on 2024/02/19 16:39:00 UTC
[jira] [Commented] (NUTCH-3030) Update default TLS cipher suites for http(s) protocol
[ https://issues.apache.org/jira/browse/NUTCH-3030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17818529#comment-17818529 ]
Markus Jelsma commented on NUTCH-3030:
--------------------------------------
Nice!
[https://www.afas.nl/] has interesting SSL settings, not allowing many different cipher suites. Nutch' out of the box settings cause a fetch for this site to fail.
Relying on what the system has to offer fixes the issue without manually listing available cipher suites.
> Update default TLS cipher suites for http(s) protocol
> -----------------------------------------------------
>
> Key: NUTCH-3030
> URL: https://issues.apache.org/jira/browse/NUTCH-3030
> Project: Nutch
> Issue Type: Improvement
> Affects Versions: 1.19
> Reporter: Martin Djukanovic
> Assignee: Markus Jelsma
> Priority: Minor
> Attachments: default_ciphers_and_protocols-2.patch
>
>
> If http.tls.supported.cipher.suites is not set in the configuration, it defaults to a hard-coded list which is not exhaustive enough. I have encountered websites that exclusively use ciphers which are not included, so they could not be handled by protocol-http.
> I changed this list to the system default -- SSLSocketFactory's .getDefaultCipherSuites() to be precise. One could also use .getSupportedCipherSuites() here, I suppose.
> The original list should be moved to nutch-default.xml or omitted altogether. The protocol list is still hard-coded, but it is now also added to nutch-default.xml (so it can be easily changed manually if needed).
--
This message was sent by Atlassian Jira
(v8.20.10#820010)