You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2023/02/17 16:27:00 UTC

[jira] [Commented] (NUTCH-2983) nutch-default.xml improvements

    [ https://issues.apache.org/jira/browse/NUTCH-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17690477#comment-17690477 ] 

ASF GitHub Bot commented on NUTCH-2983:
---------------------------------------

sebastian-nagel opened a new pull request, #756:
URL: https://github.com/apache/nutch/pull/756

   - remove property `hadoop.job.history.user.location`, obsolete since Hadoop 0.21.0
   - normalize spelling (case) of URL and CrawlDb
   - trim trailing space
   - fix typos
   - improve description of properties `{db,linkdb}.ignore.{ex,in}ternal.links`




> nutch-default.xml improvements
> ------------------------------
>
>                 Key: NUTCH-2983
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2983
>             Project: Nutch
>          Issue Type: Improvement
>          Components: configuration, documentation
>    Affects Versions: 1.20
>            Reporter: Sebastian Nagel
>            Priority: Major
>             Fix For: 1.20
>
>
> This issue covers a couple of improvements in the nutch-default.xml
> - removal of obsolete properties
> - complete description of properties related to following internal/external links in CrawlDb and LinkDb
> - typos and formatting



--
This message was sent by Atlassian Jira
(v8.20.10#820010)