You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2011/08/22 17:18:29 UTC

[jira] [Commented] (CONNECTORS-243) Web crawler must get the "Last-Modified" HTTP header and pass it as metadata to output

    [ https://issues.apache.org/jira/browse/CONNECTORS-243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13088754#comment-13088754 ] 

Karl Wright commented on CONNECTORS-243:
----------------------------------------

I'll try to have a look at this this evening.


> Web crawler must get the "Last-Modified" HTTP header and pass it as metadata to output
> --------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-243
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-243
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Web connector
>    Affects Versions: ManifoldCF 0.2
>            Reporter: Jan Høydahl
>              Labels: last-modified
>
> Last-Modified is important in web search, at it may be used for (de)boosting based on date.
> In fact, ManifoldCF should have the ability to parse any (or all) HTTP headers from source document and pass it on.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira