You are viewing a plain text version of this content. The canonical link for it is here.
- [nutch] branch master updated (6ca3c5b -> 65361d0) - posted by sn...@apache.org on 2019/11/07 08:01:50 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-1559 parse-metatags duplicates extracted metatags - do not add metatags already in ParseData's as this may lead to duplicates - add unit test - fix logging in MetaTagsParser to use slf4j - posted by sn...@apache.org on 2019/11/07 08:23:56 UTC, 0 replies.
- [nutch] branch master updated: Fix for NUTCH-2750 - posted by sn...@apache.org on 2019/11/12 14:29:45 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2739: Upgrade ES and migrate to REST client - posted by sn...@apache.org on 2019/11/22 13:23:43 UTC, 0 replies.
- [nutch] branch master updated: NUTCH-2746 Basic URL normalizer to normalize Unicode domain names - convert domain names IDN to ASCII or ASCII to IDN (configured by urlnormalizer.basic.host.idn) - strip trailing dot in host names (if urlnormalizer.basic.host.trim-trailing-dot is true) - posted by sn...@apache.org on 2019/11/22 17:52:26 UTC, 0 replies.