You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/09 08:53:22 UTC

[jira] [Resolved] (NUTCH-62) Add html META tag information into metaData in index-more plugin

     [ https://issues.apache.org/jira/browse/NUTCH-62?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche resolved NUTCH-62.
--------------------------------

    Resolution: Implemented

This can be done in a more flexible way using index-metadata
https://issues.apache.org/jira/browse/NUTCH-1264
                
> Add html META tag information into metaData in index-more plugin
> ----------------------------------------------------------------
>
>                 Key: NUTCH-62
>                 URL: https://issues.apache.org/jira/browse/NUTCH-62
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Jack Tang
>            Priority: Trivial
>         Attachments: index-more.patch.zip
>
>
> Now(version dev-0.7), only some metaData  in http response such as type, date, content-length are available int the index-more plugin. And we cannot index/sotre the meta data in html header (<META> exactly)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira