You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/31 23:33:14 UTC

[jira] Updated: (NUTCH-65) index-more plugin can't parse large set of modification-date

     [ http://issues.apache.org/jira/browse/NUTCH-65?page=all ]

Jerome Charron updated NUTCH-65:
--------------------------------

    Version: 0.7
             0.8-dev

> index-more plugin can't parse large set of  modification-date
> -------------------------------------------------------------
>
>          Key: NUTCH-65
>          URL: http://issues.apache.org/jira/browse/NUTCH-65
>      Project: Nutch
>         Type: Bug
>   Components: indexer
>     Versions: 0.7, 0.8-dev
>  Environment: nutch 0.7, java 1.5, linux
>     Reporter: Lutischán Ferenc

>
> I found a problem in MoreIndexingFilter.java.
> When I indexing segments, I get large list of error messages:
> can't parse errorenous date: Wed, 10 Sep 2003 11:59:14 or
> can't parse errorenous date: Wed, 10 Sep 2003 11:59:14GMT
> I modifiing source code (I don't make a 'patch'):
> Original (lines 137-138):
> DateFormat df = new SimpleDateFormat("EEE MMM dd HH:mm:ss yyyy zzz");
> Date d = df.parse(date);
> New:
> DateFormat df = new SimpleDateFormat("EEE, MMM dd HH:mm:ss yyyy", Locale.US);
> Date d = df.parse(date.substring(0,25));
> The modified code works fine.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira