You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/05/01 18:21:16 UTC
[jira] [Commented] (NUTCH-1657) ORIGINAL_CHAR_ENCODING and
CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser
[ https://issues.apache.org/jira/browse/NUTCH-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986726#comment-13986726 ]
Julien Nioche commented on NUTCH-1657:
--------------------------------------
+1 thanks!
> ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser
> -------------------------------------------------------------------------------
>
> Key: NUTCH-1657
> URL: https://issues.apache.org/jira/browse/NUTCH-1657
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 2.2.1
> Reporter: Talat UYARER
> Priority: Minor
> Fix For: 2.3
>
> Attachments: NUTCH-1657.patch
>
>
> ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION are never set in HTMLParser.java.
> In 2.x, we didn't set this value any field. Actually we never use this value in 2.x I thought delete them. But Feng Lu guided me and I will set metadata field.
--
This message was sent by Atlassian JIRA
(v6.2#6252)