You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2018/04/21 16:34:35 UTC
[nutch] branch master updated (60a41ef -> d104882)
This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 60a41ef Merge pull request #315 from sebastian-nagel/NUTCH-2552-deprecated-properties-new-map-reduce-api
add 8ed7d4f NUTCH-2553 Fetcher not to modify URLs to be fetched - fix bug in fetcher.QueueFeeder which caused the same key-value pair to be overwritten again and again (Hadoop object reuse) - simplify URL handling in FetcherThread: hold URLs exclusively in FetchItem - parametrize log messages and remove unused imports and variables
add 3a6c23a NUTCH-2553 Add comment why objects are copied in QueueFeeder
new d104882 Merge pull request #317 from sebastian-nagel/NUTCH-2553-fetcher-object-reuse
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/java/org/apache/nutch/fetcher/Fetcher.java | 1 -
.../apache/nutch/fetcher/FetcherOutputFormat.java | 2 -
.../org/apache/nutch/fetcher/FetcherThread.java | 148 ++++++++++-----------
src/java/org/apache/nutch/fetcher/QueueFeeder.java | 25 ++--
.../org/apache/nutch/crawl/CrawlDBTestUtil.java | 21 +--
5 files changed, 90 insertions(+), 107 deletions(-)
--
To stop receiving notification emails like this one, please contact
snagel@apache.org.
[nutch] 01/01: Merge pull request #317 from
sebastian-nagel/NUTCH-2553-fetcher-object-reuse
Posted by sn...@apache.org.
This is an automated email from the ASF dual-hosted git repository.
snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
commit d10488217e3996b2263910e67a5b05711fe4b9a2
Merge: 60a41ef 3a6c23a
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Sat Apr 21 18:34:33 2018 +0200
Merge pull request #317 from sebastian-nagel/NUTCH-2553-fetcher-object-reuse
NUTCH-2553 Fetcher not to modify URLs to be fetched
src/java/org/apache/nutch/fetcher/Fetcher.java | 1 -
.../apache/nutch/fetcher/FetcherOutputFormat.java | 2 -
.../org/apache/nutch/fetcher/FetcherThread.java | 148 ++++++++++-----------
src/java/org/apache/nutch/fetcher/QueueFeeder.java | 25 ++--
.../org/apache/nutch/crawl/CrawlDBTestUtil.java | 21 +--
5 files changed, 90 insertions(+), 107 deletions(-)
--
To stop receiving notification emails like this one, please contact
snagel@apache.org.