You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/10/27 12:41:19 UTC

[jira] Updated: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

     [ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-900:
--------------------------------

    Attachment: NUTCH-900-1.3.patch

This patch is for branch-1.3 and fixes a typo in http.content.limit

> Confusion in nutch-default between http.content.limit and file.content.limit
> ----------------------------------------------------------------------------
>
>                 Key: NUTCH-900
>                 URL: https://issues.apache.org/jira/browse/NUTCH-900
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.2, 2.0
>            Reporter: Markus Jelsma
>            Assignee: Julien Nioche
>            Priority: Trivial
>             Fix For: 1.2, 2.0
>
>         Attachments: NUTCH-900-1.3.patch, NUTCH-900.MarkusJelsma.100908.patch.txt
>
>
> The http.content.limit and file.content.limit settings can be confusing and have fooled at least several users. The description element for these settings should be changed to reflect the difference between them so users won't be fooled that easy.
> See also: http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html for a discussion.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.