You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Antoinette (JIRA)" <ji...@apache.org> on 2013/08/01 21:01:50 UTC

[jira] [Issue Comment Deleted] (NUTCH-1406) index-metadata plugin: conversion to Solr date format

     [ https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Antoinette updated NUTCH-1406:
------------------------------

    Comment: was deleted

(was: Our application of this patch to Nutch 1.6 produces the following: 'ERROR solr.SolrIndexer - java.io.IOException: Job failed!' It works if the 'abstract' declaration is removed, but this then causes IndexingFiltersChecker to fail. Any suggestions or updates?)
    
> index-metadata plugin: conversion to Solr date format
> -----------------------------------------------------
>
>                 Key: NUTCH-1406
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1406
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer, parser
>            Reporter: Kristof 
>            Priority: Minor
>              Labels: conversion, date
>             Fix For: 1.9
>
>         Attachments: index-metadata_formatted.patch
>
>
> This improvement to the index-mdata plugin allows for conversion of selected fields to the Solr date format. The main benefit of this conversion is the possibility to create range facets.
> In order to convert the values of selected metatags to Solr date format, you must specify in nutch-site.xml. This can be for example used with Dublin Core elements. A subdomain which would have pages with the meta tag dcterms.modified would be cic.gc.ca. dcterms.modified must also be defined in the metatags.names and index.parse.md properties.
>  
> {code}
> <property>
> 	<name>index.dateconvert.md</name>
> 	<value>metatag.dcterms.modified</value>
> 	<description>For plugin index-metadata: Indicate here the name of the html meta tag that should be converted to Solr date format.
> 	</description>
> </property>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira