You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/08 23:48:16 UTC

[jira] [Reopened] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware

     [ https://issues.apache.org/jira/browse/NUTCH-1554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel reopened NUTCH-1554:
------------------------------------


Hi Lewis, the opposite is true: now the HttpDateFormat is sensitive to the locale set on the system. If a Russian locale is used the if-modified-since date sent in the HTTP header will look like:
{code}
% LC_ALL=ru_RU.utf8 runtime/local/bin/nutch \
    org.apache.nutch.net.protocols.HttpDateFormat
Пн, 08 апр 2013 21:24:24 GMT
{code}
That's definitely not the date format specified in the HTTP RFC (http://www.w3.org/Protocols/rfc2616/rfc2616-sec3.html#sec3.3.1).

See also: http://blog.thetaphi.de/2012/07/default-locales-default-charsets-and.html

                
> org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware
> ---------------------------------------------------------------------------
>
>                 Key: NUTCH-1554
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1554
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.6, 2.1
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 1.7, 2.2
>
>         Attachments: NUTCH-1554-2.x.patch, NUTCH-1554-trunk.patch
>
>
> I assume this is legacy code.
> Currently the above class is Locale specific and really should not be. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira