You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2017/12/17 11:33:33 UTC
[nutch] 01/01: Merge pull request #263 from
sebastian-nagel/nutch-2478-parser-resolve-base-url
This is an automated email from the ASF dual-hosted git repository.
snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
commit d73f2930cbb57ad1a37667cecd73f38daca7a1cd
Merge: 45ce310 2aec79f
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Sun Dec 17 12:33:30 2017 +0100
Merge pull request #263 from sebastian-nagel/nutch-2478-parser-resolve-base-url
NUTCH-2478 parser to resolve base hfref URL based on fetched URL
src/java/org/apache/nutch/util/DomUtil.java | 9 ++++++
.../apache/nutch/parse/html/DOMContentUtils.java | 7 ++---
.../org/apache/nutch/parse/html/HtmlParser.java | 12 ++++++--
.../apache/nutch/parse/html/TestHtmlParser.java | 26 +++++++++++++++++-
.../apache/nutch/parse/tika/DOMContentUtils.java | 7 ++---
.../org/apache/nutch/parse/tika/TikaParser.java | 15 ++++++++--
.../org/apache/nutch/tika}/TestHtmlParser.java | 32 +++++++++++++++++++---
7 files changed, 88 insertions(+), 20 deletions(-)
--
To stop receiving notification emails like this one, please contact
"commits@nutch.apache.org" <co...@nutch.apache.org>.