You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/08 23:48:16 UTC
[jira] [Reopened] (NUTCH-1554)
org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware
[ https://issues.apache.org/jira/browse/NUTCH-1554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel reopened NUTCH-1554:
------------------------------------
Hi Lewis, the opposite is true: now the HttpDateFormat is sensitive to the locale set on the system. If a Russian locale is used the if-modified-since date sent in the HTTP header will look like:
{code}
% LC_ALL=ru_RU.utf8 runtime/local/bin/nutch \
org.apache.nutch.net.protocols.HttpDateFormat
Пн, 08 апр 2013 21:24:24 GMT
{code}
That's definitely not the date format specified in the HTTP RFC (http://www.w3.org/Protocols/rfc2616/rfc2616-sec3.html#sec3.3.1).
See also: http://blog.thetaphi.de/2012/07/default-locales-default-charsets-and.html
> org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware
> ---------------------------------------------------------------------------
>
> Key: NUTCH-1554
> URL: https://issues.apache.org/jira/browse/NUTCH-1554
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 1.6, 2.1
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Minor
> Fix For: 1.7, 2.2
>
> Attachments: NUTCH-1554-2.x.patch, NUTCH-1554-trunk.patch
>
>
> I assume this is legacy code.
> Currently the above class is Locale specific and really should not be.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira