You are viewing a plain text version of this content. The canonical link for it is here.
- [nutch] branch master updated: NUTCH-2718: file names of configuration files of index writers and exchanges are configurable. - posted by r0...@apache.org on 2019/09/01 04:50:04 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2719: Showing a warning when an exchange points to an indexer that doesn't exist. - posted by r0...@apache.org on 2019/09/01 04:50:55 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2669 Reliable solution for javax.ws packaging.type - update org.apache.cxf to drop javax.ws.rs-api dependency - posted by sn...@apache.org on 2019/09/01 06:52:58 UTC, 0 replies.
- [nutch] branch master updated (fa9f895 -> 9a9f425) - posted by sn...@apache.org on 2019/09/01 08:47:04 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2598 URLNormalizerChecker fails on invalid URLs in input - output empty string for invalid URLs (MalformdURLException thrown) or if normalizer(s) return null - posted by sn...@apache.org on 2019/09/01 10:05:51 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2696 Nutch SegmentReader does not dump non-ASCII characters with Hadoop 3.x - open streams using fixed UTF-8 encoding - posted by sn...@apache.org on 2019/09/01 15:48:44 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2732: .template for configuration files. - posted by r0...@apache.org on 2019/09/09 12:57:30 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2612 Support for sitemap processing by hostname - posted by ma...@apache.org on 2019/09/09 13:01:16 UTC, 0 replies.
- [nutch] branch master updated (87b08fc -> 9e5ae73) - posted by r0...@apache.org on 2019/09/19 13:10:55 UTC, 0 replies.
- [nutch] branch 2.x updated: NUTCH-2734 Upgrade Tika dependency to 1.22 - fall back to default Tika config if custom config file is not found - warn if loading a parser fails (reports potential plugin class loader issues - improve ant build file to download plugin dependencies (src/plugin/parse-tika/build-ivy.xml) - complete exclusions of dependencies provided also in Nutch core - force same version of xml-apis to be used by tika-core and tika-parsers: otherwise Tika parsers may fail with a linkage error because d [...] - posted by sn...@apache.org on 2019/09/20 16:07:12 UTC, 0 replies.
- [nutch] branch branch-2.4 updated: NUTCH-2734 Upgrade Tika dependency to 1.22 - fall back to default Tika config if custom config file is not found - warn if loading a parser fails (reports potential plugin class loader issues - improve ant build file to download plugin dependencies (src/plugin/parse-tika/build-ivy.xml) - complete exclusions of dependencies provided also in Nutch core - force same version of xml-apis to be used by tika-core and tika-parsers: otherwise Tika parsers may fail with a linkage error because d [...] - posted by sn...@apache.org on 2019/09/23 08:32:54 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-1982 Make Git ignore IDE project files and add note about IDE setup - apply and update patch contributed by Marko Asplund - posted by sn...@apache.org on 2019/09/23 08:44:53 UTC, 0 replies.
- [nutch] branch branch-2.4 updated (52188de -> 7550c5a) - posted by sn...@apache.org on 2019/09/23 13:52:59 UTC, 0 replies.
- [nutch] 01/03: Add NUTCH-2722 and NUTCH-2734 to CHANGES.txt - posted by sn...@apache.org on 2019/09/23 13:53:00 UTC, 0 replies.
- [nutch] 02/03: Add dependency to hbase-common explicitly to ivy.xml to avoid errors during runtime - posted by sn...@apache.org on 2019/09/23 13:53:01 UTC, 0 replies.
- [nutch] 03/03: Update 2.4 release date - posted by sn...@apache.org on 2019/09/23 13:53:02 UTC, 0 replies.
- [nutch] branch branch-2.4 updated (7550c5a -> e803acd) - posted by sn...@apache.org on 2019/09/23 13:57:56 UTC, 0 replies.
- [nutch] 01/01: Update 2.4 release date - posted by sn...@apache.org on 2019/09/23 13:57:57 UTC, 0 replies.
- [nutch] annotated tag release-2.4 updated (e803acd -> af4932c) - posted by sn...@apache.org on 2019/09/23 15:53:05 UTC, 0 replies.
- svn commit: r35983 [1/3] - in /dev/nutch/2.4: ./ CHANGES.txt apache-nutch-2.4-src.tar.gz apache-nutch-2.4-src.tar.gz.asc apache-nutch-2.4-src.tar.gz.sha512 apache-nutch-2.4-src.zip apache-nutch-2.4-src.zip.asc apache-nutch-2.4-src.zip.sha512 - posted by sn...@apache.org on 2019/09/24 09:15:35 UTC, 0 replies.
- svn commit: r35983 [3/3] - in /dev/nutch/2.4: ./ CHANGES.txt apache-nutch-2.4-src.tar.gz apache-nutch-2.4-src.tar.gz.asc apache-nutch-2.4-src.tar.gz.sha512 apache-nutch-2.4-src.zip apache-nutch-2.4-src.zip.asc apache-nutch-2.4-src.zip.sha512 - posted by sn...@apache.org on 2019/09/24 09:15:35 UTC, 0 replies.
- svn commit: r35983 [2/3] - in /dev/nutch/2.4: ./ CHANGES.txt apache-nutch-2.4-src.tar.gz apache-nutch-2.4-src.tar.gz.asc apache-nutch-2.4-src.tar.gz.sha512 apache-nutch-2.4-src.zip apache-nutch-2.4-src.zip.asc apache-nutch-2.4-src.zip.sha512 - posted by sn...@apache.org on 2019/09/24 09:15:35 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2736 Upgrade Dockerfile to be based on recent Ubuntu LTS version - upgrade to use Ubuntu 18.04 - remove installed packages not required to build Nutch - posted by sn...@apache.org on 2019/09/27 09:18:14 UTC, 0 replies.
- [nutch] branch master updated (caa9422 -> ff9f025) - posted by sn...@apache.org on 2019/09/30 07:15:08 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2482 index-geoip not to add null values to document fields - also improve handling of errors when searching for and reading GeoIP database files - upgrade dependencies - posted by sn...@apache.org on 2019/09/30 07:23:45 UTC, 0 replies.
- [nutch] branch master updated (0f46927 -> 9e49c3f) - posted by sn...@apache.org on 2019/09/30 11:30:44 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2381 In some situations the class TextProfileSignature gives different signatures for the same text "profile" page. - implement secondary sorting (similar to patch provided by Rodrigo Joni Sestari) - allow to restore previous behavior by setting property `db.signature.text_profile.sec_sort_lex = false` - posted by sn...@apache.org on 2019/09/30 11:31:52 UTC, 0 replies.