You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2018/11/14 12:04:54 UTC

[nutch] branch master updated (f443f1b -> 8151237)

This is an automated email from the ASF dual-hosted git repository.

snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.


    from f443f1b  Merge pull request #395 from sebastian-nagel/NUTCH-2655-solr-schema-7x
     add 54f156c  NUTCH-2630 Fetcher to log skipped records by robots.txt - change required log level to INFO (default) for messages   reporting skipped URLs because of robots.txt rules   (disallow or crawl delay larger than fetcher.max.crawl.delay)
     new 8151237  Merge pull request #387 from sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 src/java/org/apache/nutch/fetcher/FetcherThread.java | 6 ++----
 1 file changed, 2 insertions(+), 4 deletions(-)


[nutch] 01/01: Merge pull request #387 from sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied

Posted by sn...@apache.org.
This is an automated email from the ASF dual-hosted git repository.

snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git

commit 8151237a4000972f79ee30b371cc7be1dbb10d04
Merge: f443f1b 54f156c
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Wed Nov 14 13:04:49 2018 +0100

    Merge pull request #387 from sebastian-nagel/NUTCH-2630-fetcher-log-robotstxt-denied
    
    NUTCH-2630 Fetcher to log skipped records by robots.txt

 src/java/org/apache/nutch/fetcher/FetcherThread.java | 6 ++----
 1 file changed, 2 insertions(+), 4 deletions(-)