You are viewing a plain text version of this content. The canonical link for it is here.
- [nutch] branch 2.x updated (6adca89 -> add07fa) - posted by sn...@apache.org on 2019/01/04 08:49:03 UTC, 0 replies.
- [nutch] 01/01: Merge pull request #423 from sebastian-nagel/NUTCH-2667-upgrade-tika-commons-collections4 - posted by sn...@apache.org on 2019/01/04 08:49:04 UTC, 0 replies.
- [nutch] branch 2.x updated: NUTCH-2667 Update Tika and Commons Collections 4 - explicitly add dependency to commons-compress 1.18 for tika-core - posted by sn...@apache.org on 2019/01/04 14:43:55 UTC, 0 replies.
- [nutch] branch master updated (43d26ce -> 3ab0227) - posted by sn...@apache.org on 2019/01/06 11:17:31 UTC, 0 replies.
- [nutch] 01/01: Merge pull request #398 from jorgelbg/doc-indexer-links - posted by sn...@apache.org on 2019/01/06 11:17:32 UTC, 0 replies.
- [nutch] branch master updated (3ab0227 -> 58ef2da) - posted by sn...@apache.org on 2019/01/06 11:52:49 UTC, 0 replies.
- [nutch] 01/01: Merge pull request #422 from sebastian-nagel/NUTCH-2657-http-headers-crlf - posted by sn...@apache.org on 2019/01/06 11:52:50 UTC, 0 replies.
- [nutch] branch master updated (58ef2da -> 6274083) - posted by sn...@apache.org on 2019/01/06 19:40:33 UTC, 0 replies.
- [nutch] 01/01: Merge pull request #371 from sebastian-nagel/NUTCH-2628-fetcher-signature - posted by sn...@apache.org on 2019/01/06 19:40:34 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2475 If and else-if branches has the same condition - remove duplicated condition to handle ftp status 451 (requested action aborted) - posted by sn...@apache.org on 2019/01/06 20:13:54 UTC, 0 replies.
- [nutch] branch 2.x updated: NUTCH-2475 If and else-if branches has the same condition - remove duplicated condition to handle ftp status 451 (requested action aborted) - posted by sn...@apache.org on 2019/01/06 20:15:15 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2687 Regex for reading title from Content-Disposition is wrong - posted by ma...@apache.org on 2019/01/18 10:38:08 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2678 Allow for per-host configurable protocol plugin - posted by ma...@apache.org on 2019/01/18 12:27:43 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2663 Improve the JEXL syntax for getting values from the metadata/context - posted by sn...@apache.org on 2019/01/18 15:24:19 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2680 Documentation: https supported by multiple protocol plugins not only httpclient Improve description of property plugin.includes: - https is supported by default - no need to enable the stub plugin nutch-extensionpoints - posted by sn...@apache.org on 2019/01/18 15:26:24 UTC, 0 replies.
- [nutch] branch master updated (0c18f6c -> 86aac2d) - posted by r0...@apache.org on 2019/01/21 13:59:28 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2682 Upgrade to Tika 1.20 - upgrade to Tika dependencies to version 1.20 - plugin parse-tika: add exclusions of transitive dependencies already provided as Nutch core dependencies - upgrade Nutch core dependencies to match versions required by Tika 1.20 - apply code formatting template to TikaParser class and replace deprecated method calls - posted by sn...@apache.org on 2019/01/21 15:36:29 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2685: README.md file for exchange-jexl plugin. - posted by sn...@apache.org on 2019/01/22 16:26:47 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2691: Improve logging from scoring-depth plugin - posted by sn...@apache.org on 2019/01/29 10:16:57 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2689 Speed up urlfilter-regex and urlfilter-automaton - do not extract host and domain name from the URL if not needed - speed up regular expressions: - use non-capturing groups if possible - use (?i) to make the patterns case insensitiven and remove uppercase variants to keep alternations shorter - posted by sn...@apache.org on 2019/01/29 10:31:43 UTC, 0 replies.