You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Ilguiz Latypov (JIRA)" <ji...@apache.org> on 2009/04/30 21:54:30 UTC

[jira] Created: (NUTCH-733) plain text view of cached files ignores HTML encoding

plain text view of cached files ignores HTML encoding
-----------------------------------------------------

                 Key: NUTCH-733
                 URL: https://issues.apache.org/jira/browse/NUTCH-733
             Project: Nutch
          Issue Type: Improvement
          Components: web gui
    Affects Versions: 1.0.0
            Reporter: Ilguiz Latypov


The plain text view of cached HitDetails is sent as raw text under the Content-Type: text/html header.

Either the content type should be changed to text/plain (patch attached) or the text should be HTML-encoded (perhaps, using http://commons.apache.org/lang/api/org/apache/commons/lang/StringEscapeUtils.html).


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-733) plain text view of cached files ignores HTML encoding

Posted by "Doğacan Güney (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717056#action_12717056 ] 

Doğacan Güney commented on NUTCH-733:
-------------------------------------

I think I prefer HTML-encoding but this also looks good to me. So I will commit it if there are no objections.

> plain text view of cached files ignores HTML encoding
> -----------------------------------------------------
>
>                 Key: NUTCH-733
>                 URL: https://issues.apache.org/jira/browse/NUTCH-733
>             Project: Nutch
>          Issue Type: Improvement
>          Components: web gui
>    Affects Versions: 1.0.0
>            Reporter: Ilguiz Latypov
>         Attachments: text.jsp-plain-text.diff.txt
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> The plain text view of cached HitDetails is sent as raw text under the Content-Type: text/html header.
> Either the content type should be changed to text/plain (patch attached) or the text should be HTML-encoded (perhaps, using http://commons.apache.org/lang/api/org/apache/commons/lang/StringEscapeUtils.html).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-733) plain text view of cached files ignores HTML encoding

Posted by "Ilguiz Latypov (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12705819#action_12705819 ] 

Ilguiz Latypov commented on NUTCH-733:
--------------------------------------

My patch does not wrap long lines.

> plain text view of cached files ignores HTML encoding
> -----------------------------------------------------
>
>                 Key: NUTCH-733
>                 URL: https://issues.apache.org/jira/browse/NUTCH-733
>             Project: Nutch
>          Issue Type: Improvement
>          Components: web gui
>    Affects Versions: 1.0.0
>            Reporter: Ilguiz Latypov
>         Attachments: text.jsp-plain-text.diff.txt
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> The plain text view of cached HitDetails is sent as raw text under the Content-Type: text/html header.
> Either the content type should be changed to text/plain (patch attached) or the text should be HTML-encoded (perhaps, using http://commons.apache.org/lang/api/org/apache/commons/lang/StringEscapeUtils.html).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-733) plain text view of cached files ignores HTML encoding

Posted by "Ilguiz Latypov (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ilguiz Latypov updated NUTCH-733:
---------------------------------

    Attachment: text.jsp-plain-text.diff.txt

Set the Content-Type: "text/plain; charset=UTF-8" header.  Ignore HTML-rich i18n message taglib for now.  Perhaps, there is a plain-text version of the "note" tag.  I do not know.



> plain text view of cached files ignores HTML encoding
> -----------------------------------------------------
>
>                 Key: NUTCH-733
>                 URL: https://issues.apache.org/jira/browse/NUTCH-733
>             Project: Nutch
>          Issue Type: Improvement
>          Components: web gui
>    Affects Versions: 1.0.0
>            Reporter: Ilguiz Latypov
>         Attachments: text.jsp-plain-text.diff.txt
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> The plain text view of cached HitDetails is sent as raw text under the Content-Type: text/html header.
> Either the content type should be changed to text/plain (patch attached) or the text should be HTML-encoded (perhaps, using http://commons.apache.org/lang/api/org/apache/commons/lang/StringEscapeUtils.html).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.