You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Dietrich Schmidt (JIRA)" <ji...@apache.org> on 2011/04/19 20:35:06 UTC

[jira] [Updated] (NUTCH-985) Problems indexing lastModifiedDate in Solr

     [ https://issues.apache.org/jira/browse/NUTCH-985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dietrich Schmidt updated NUTCH-985:
-----------------------------------

    Attachment: indexlastmodifieddate.jar

this custom plugin will add a field to the Nutch document. The field is called "lastModifiedDateFormat" and is in the format expected by Solr.
Please make sure to add the plugin to your configuration file.

> Problems indexing lastModifiedDate in Solr
> ------------------------------------------
>
>                 Key: NUTCH-985
>                 URL: https://issues.apache.org/jira/browse/NUTCH-985
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>            Reporter: Dietrich Schmidt
>         Attachments: indexlastmodifieddate.jar
>
>
> I am using the index-more plugin to parse the lastModified data in web
> pages in order to store it in a Solr data field.
> In solrindex-mapping.xml I am mapping lastModified to a field "changed" in Solr:
>                 <field dest="changed" source="lastModified"/>
> However, when posting data to Solr the SolrIndexer posts it as a long,
> not as a date:
> <add><doc boost="1.0"><field
> name="changed">1079326800000</field><field
> name="tstamp">20110414144140188</field><field
> name="date">20040315</field>
> Solr rejects the data because of the improper data type.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira