You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2018/06/12 16:04:58 UTC

[nutch] 01/01: Merge pull request #347 from sebastian-nagel/NUTCH-2549-protocol-http-fixes

This is an automated email from the ASF dual-hosted git repository.

snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git

commit 106df966444311aa7d35b443105d00173bdc4847
Merge: 4bcaeeb 2e485cf
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Tue Jun 12 18:04:42 2018 +0200

    Merge pull request #347 from sebastian-nagel/NUTCH-2549-protocol-http-fixes
    
    NUTCH-2549  protocol-http does not behave the same as browsers

 conf/nutch-default.xml                             |   6 +-
 .../org/apache/nutch/metadata/HttpHeaders.java     |   2 +
 .../nutch/metadata/SpellCheckedMetadata.java       |   7 +-
 .../apache/nutch/protocol/http/HttpResponse.java   | 161 +++++++----
 .../src/test/conf/nutch-site-test.xml              |   8 +-
 .../protocol/http/TestBadServerResponses.java      | 313 +++++++++++++++++++++
 .../urlnormalizer/basic/BasicURLNormalizer.java    |  20 +-
 .../basic/TestBasicURLNormalizer.java              |   2 +
 8 files changed, 452 insertions(+), 67 deletions(-)

-- 
To stop receiving notification emails like this one, please contact
snagel@apache.org.