You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/02/16 11:11:12 UTC
[jira] [Updated] (NUTCH-1921) Optionally disable HTTP
if-modified-since header
[ https://issues.apache.org/jira/browse/NUTCH-1921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Jelsma updated NUTCH-1921:
---------------------------------
Fix Version/s: (was: 1.11)
1.10
Summary: Optionally disable HTTP if-modified-since header (was: Optionally parse fetch_not_modified)
> Optionally disable HTTP if-modified-since header
> ------------------------------------------------
>
> Key: NUTCH-1921
> URL: https://issues.apache.org/jira/browse/NUTCH-1921
> Project: Nutch
> Issue Type: Bug
> Components: fetcher
> Affects Versions: 1.9
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Fix For: 1.10
>
> Attachments: NUTCH-1921-trunk.patch
>
>
> Records with fetch_not_modified are not parsed and are not passed through parse filters, index filters and are not being indexed. This is a huge problem if you modified parser filter, indexing filter or whatever behaviour in the pipe line because changes never show up in the index.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)