You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "kiran (JIRA)" <ji...@apache.org> on 2013/05/14 01:33:17 UTC

[jira] [Updated] (NUTCH-1561) improve usability of parse-metatags and index-metadata

     [ https://issues.apache.org/jira/browse/NUTCH-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

kiran updated NUTCH-1561:
-------------------------

    Attachment: NUTCH-1561-v1.patch

The patch fixes the two issues raised above

i) The property "metatags.names" uses ',' as separator 
ii) metatags in index.parse.md can use uppercase or lowercase for defining fields
                
> improve usability of parse-metatags and index-metadata
> ------------------------------------------------------
>
>                 Key: NUTCH-1561
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1561
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.6
>            Reporter: Sebastian Nagel
>            Priority: Minor
>             Fix For: 1.8
>
>         Attachments: NUTCH-1561-v1.patch
>
>
> Usually, the plugins parse-metatags and index-metadata are used in combination: the former "extracts" meta tags, the latter adds the extracted tags as fields to the index. 
> Configuration of the two plugins differs which causes pitfalls and reduces the usability (see example config):
> * the property "metatags.names" of parse-metatags uses ';' as separator instead of ',' used by index-metadata
> * meta tags have to be lowercased in index-metadata
> {code}
> <property>
>   <name>metatags.names</name>
>   <value>DC.creator;DCTERMS.bibliographicCitation</value>
> </property>
> <property>
>   <name>index.parse.md</name>
>   <value>metatag.dc.creator,metatag.dcterms.bibliographiccitation</value>
> </property>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira