You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2017/12/18 16:29:53 UTC
[nutch] branch master updated (dd94a61 -> c6e5dfb)
This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from dd94a61 NUTCH-2380 Upgrade indexer-elastic to Elasticsearch version 5.3.0 (contributed by Jurian Broertjes)
add 7cc622e NUTCH-2365 Fetcher to respect db.ignore.external.links.mode for redirects - restructure method handleRedirects: result of URL filters is checked early - simplify debug logging calls
add 856e551 NUTCH-2216 db.ignore.*.links to optionally follow internal redirects - add option db.ignore.also.redirects which allows to follow redirects even if internal/external links are not followedNUTCH-2216
new c6e5dfb Merge pull request #264 from sebastian-nagel/nutch-2365-fetcher-redirects-mode
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
conf/nutch-default.xml | 14 ++-
.../org/apache/nutch/fetcher/FetcherThread.java | 115 +++++++++++----------
2 files changed, 72 insertions(+), 57 deletions(-)
--
To stop receiving notification emails like this one, please contact
['"commits@nutch.apache.org" <co...@nutch.apache.org>'].
[nutch] 01/01: Merge pull request #264 from
sebastian-nagel/nutch-2365-fetcher-redirects-mode
Posted by sn...@apache.org.
This is an automated email from the ASF dual-hosted git repository.
snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
commit c6e5dfb3d2f430d9b899a273515f58c093295baa
Merge: dd94a61 856e551
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Mon Dec 18 17:29:51 2017 +0100
Merge pull request #264 from sebastian-nagel/nutch-2365-fetcher-redirects-mode
NUTCH-2365 Fetcher to respect db.ignore.external.links.mode for redirects
NUTCH-2216 db.ignore.*.links to optionally follow internal redirects
conf/nutch-default.xml | 14 ++-
.../org/apache/nutch/fetcher/FetcherThread.java | 115 +++++++++++----------
2 files changed, 72 insertions(+), 57 deletions(-)
--
To stop receiving notification emails like this one, please contact
"commits@nutch.apache.org" <co...@nutch.apache.org>.