You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2018/06/02 11:35:17 UTC

[nutch] 01/01: Merge pull request #336 from sebastian-nagel/NUTCH-2583-upgrade-dependencies

This is an automated email from the ASF dual-hosted git repository.

snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git

commit 2544fad223faeaafef966d0f04ff00da9f749641
Merge: 0cec7b5 107b364
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Sat Jun 2 13:35:11 2018 +0200

    Merge pull request #336 from sebastian-nagel/NUTCH-2583-upgrade-dependencies
    
    NUTCH-2583 Upgrading Nutch's dependencies (contributed by Ralf)
    NUTCH-2584 Upgrade parse-tika to use Tika 1.18
    NUTCH-2589 HTML redirections are not followed when using parse-tika

 ivy/ivy.xml                                        |  67 +++++------
 src/plugin/build-plugin.xml                        |   4 +
 src/plugin/parse-tika/build.xml                    |  15 +--
 src/plugin/parse-tika/howto_upgrade_tika.txt       |  16 ++-
 src/plugin/parse-tika/ivy.xml                      |   2 +-
 src/plugin/parse-tika/plugin.xml                   |  65 +++++++----
 .../apache/nutch/parse/tika/HTMLMetaProcessor.java | 125 +++++++++++++--------
 .../org/apache/nutch/parse/tika/TikaParser.java    |  20 ++--
 .../{ => parse}/tika/TestDOMContentUtils.java      |  78 +++++++------
 .../nutch/{ => parse}/tika/TestFeedParser.java     |   2 +-
 .../nutch/{ => parse}/tika/TestHtmlParser.java     |   2 +-
 .../nutch/{ => parse}/tika/TestImageMetadata.java  |   2 +-
 .../nutch/{ => parse}/tika/TestMSWordParser.java   |   2 +-
 .../nutch/{ => parse}/tika/TestOOParser.java       |   2 +-
 .../nutch/{ => parse}/tika/TestPdfParser.java      |   2 +-
 .../nutch/{ => parse}/tika/TestRTFParser.java      |   2 +-
 .../{ => parse}/tika/TestRobotsMetaProcessor.java  |  70 ++++++++----
 17 files changed, 276 insertions(+), 200 deletions(-)

-- 
To stop receiving notification emails like this one, please contact
snagel@apache.org.