You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2023/02/17 16:27:00 UTC
[jira] [Commented] (NUTCH-2983) nutch-default.xml improvements
[ https://issues.apache.org/jira/browse/NUTCH-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17690477#comment-17690477 ]
ASF GitHub Bot commented on NUTCH-2983:
---------------------------------------
sebastian-nagel opened a new pull request, #756:
URL: https://github.com/apache/nutch/pull/756
- remove property `hadoop.job.history.user.location`, obsolete since Hadoop 0.21.0
- normalize spelling (case) of URL and CrawlDb
- trim trailing space
- fix typos
- improve description of properties `{db,linkdb}.ignore.{ex,in}ternal.links`
> nutch-default.xml improvements
> ------------------------------
>
> Key: NUTCH-2983
> URL: https://issues.apache.org/jira/browse/NUTCH-2983
> Project: Nutch
> Issue Type: Improvement
> Components: configuration, documentation
> Affects Versions: 1.20
> Reporter: Sebastian Nagel
> Priority: Major
> Fix For: 1.20
>
>
> This issue covers a couple of improvements in the nutch-default.xml
> - removal of obsolete properties
> - complete description of properties related to following internal/external links in CrawlDb and LinkDb
> - typos and formatting
--
This message was sent by Atlassian Jira
(v8.20.10#820010)