You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/02/16 11:11:12 UTC

[jira] [Updated] (NUTCH-1921) Optionally disable HTTP if-modified-since header

     [ https://issues.apache.org/jira/browse/NUTCH-1921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1921:
---------------------------------
    Fix Version/s:     (was: 1.11)
                   1.10
          Summary: Optionally disable HTTP if-modified-since header  (was: Optionally parse fetch_not_modified)

> Optionally disable HTTP if-modified-since header
> ------------------------------------------------
>
>                 Key: NUTCH-1921
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1921
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 1.9
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.10
>
>         Attachments: NUTCH-1921-trunk.patch
>
>
> Records with fetch_not_modified are not parsed and are not passed through parse filters, index filters and are not being indexed. This is a huge problem if you modified parser filter, indexing filter or whatever behaviour in the pipe line because changes never show up in the index.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)