You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2018/06/12 16:04:58 UTC
[nutch] 01/01: Merge pull request #347 from
sebastian-nagel/NUTCH-2549-protocol-http-fixes
This is an automated email from the ASF dual-hosted git repository.
snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
commit 106df966444311aa7d35b443105d00173bdc4847
Merge: 4bcaeeb 2e485cf
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Tue Jun 12 18:04:42 2018 +0200
Merge pull request #347 from sebastian-nagel/NUTCH-2549-protocol-http-fixes
NUTCH-2549 protocol-http does not behave the same as browsers
conf/nutch-default.xml | 6 +-
.../org/apache/nutch/metadata/HttpHeaders.java | 2 +
.../nutch/metadata/SpellCheckedMetadata.java | 7 +-
.../apache/nutch/protocol/http/HttpResponse.java | 161 +++++++----
.../src/test/conf/nutch-site-test.xml | 8 +-
.../protocol/http/TestBadServerResponses.java | 313 +++++++++++++++++++++
.../urlnormalizer/basic/BasicURLNormalizer.java | 20 +-
.../basic/TestBasicURLNormalizer.java | 2 +
8 files changed, 452 insertions(+), 67 deletions(-)
--
To stop receiving notification emails like this one, please contact
snagel@apache.org.