You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/06/27 12:34:47 UTC

[jira] [Updated] (NUTCH-1015) MoreIndexingFilter: can't parse erroneous date: 2006-05-24T20:03:42

     [ https://issues.apache.org/jira/browse/NUTCH-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1015:
---------------------------------

    Description: 
MoreIndexingFilter must handle the following url's gracefully:

{code}
can't parse erroneous date: Sun, 27 Jun 2010 06:51:35 GMT+1
can't parse erroneous date: ma, 27 jun 2011 05:15:32 GMT
can't parse erroneous date: "Mon, 23 May 2011 22:05:58 GMT"
can't parse erroneous date: GMT
{code}

What to do? Default to now? Fetch time? Anything? 
        Summary: MoreIndexingFilter: can't parse erroneous date: 2006-05-24T20:03:42  (was: can't parse erroneous date: 2006-05-24T20:03:42)

> MoreIndexingFilter: can't parse erroneous date: 2006-05-24T20:03:42
> -------------------------------------------------------------------
>
>                 Key: NUTCH-1015
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1015
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>            Reporter: Markus Jelsma
>             Fix For: 1.4, 2.0
>
>
> MoreIndexingFilter must handle the following url's gracefully:
> {code}
> can't parse erroneous date: Sun, 27 Jun 2010 06:51:35 GMT+1
> can't parse erroneous date: ma, 27 jun 2011 05:15:32 GMT
> can't parse erroneous date: "Mon, 23 May 2011 22:05:58 GMT"
> can't parse erroneous date: GMT
> {code}
> What to do? Default to now? Fetch time? Anything? 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira