You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2017/12/17 11:33:33 UTC

[nutch] 01/01: Merge pull request #263 from sebastian-nagel/nutch-2478-parser-resolve-base-url

This is an automated email from the ASF dual-hosted git repository.

snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git

commit d73f2930cbb57ad1a37667cecd73f38daca7a1cd
Merge: 45ce310 2aec79f
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Sun Dec 17 12:33:30 2017 +0100

    Merge pull request #263 from sebastian-nagel/nutch-2478-parser-resolve-base-url
    
    NUTCH-2478 parser to resolve base hfref URL based on fetched URL

 src/java/org/apache/nutch/util/DomUtil.java        |  9 ++++++
 .../apache/nutch/parse/html/DOMContentUtils.java   |  7 ++---
 .../org/apache/nutch/parse/html/HtmlParser.java    | 12 ++++++--
 .../apache/nutch/parse/html/TestHtmlParser.java    | 26 +++++++++++++++++-
 .../apache/nutch/parse/tika/DOMContentUtils.java   |  7 ++---
 .../org/apache/nutch/parse/tika/TikaParser.java    | 15 ++++++++--
 .../org/apache/nutch/tika}/TestHtmlParser.java     | 32 +++++++++++++++++++---
 7 files changed, 88 insertions(+), 20 deletions(-)

-- 
To stop receiving notification emails like this one, please contact
"commits@nutch.apache.org" <co...@nutch.apache.org>.