You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tomcat.apache.org by bu...@apache.org on 2001/05/21 04:18:55 UTC

[Bug 461] - Use setCharacterEncoding("UTF8") does not change the way getParameterValue*() decodes characters. BugRat Report#785

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=461

*** shadow/461	Sun May 20 17:20:28 2001
--- shadow/461.tmp.23967	Sun May 20 19:18:55 2001
***************
*** 1,19 ****
! Bug#: 461
! Product: SAPI
! Version: 4.0.b1
! Platform: 
! OS/Version: 
! Status: NEW   
! Resolution: 
! Severity: Normal
! Priority: High
! Component: 
! AssignedTo: tomcat-dev@jakarta.apache.org                            
! ReportedBy: tim_tye@usa.net               
! URL: 
! Cc: 
! Summary: Use setCharacterEncoding("UTF8") does not change the way getParameterValue*() decodes characters. BugRat Report#785
! 
  I am testing the support of UNICODE input and output for Java servlets and JSP's.  I have a very simple servlet that creates a web page with a single entry field.  When the ServletResponse is created, I use setContentType("text/html charset=UTF-8") to tell the web browsers this page contains eight bit compressed unicode.  This works, the data is placed from the servlet onto the web page in UTF8 style, and displayed by the web browser (IE and Netscape) correctly.  
  However, when data is entered by the web browser, the getParameterValues method ignores the character encoding and maps each byte to a seperate character.  It fails even when I use the setCharacterEncoding("UTF8") method on the request before reading any parameter names or values.  
! Support of character encoding on request input is critical to any users outside of Latin-1.
--- 1,19 ----
! +============================================================================+
! | Use setCharacterEncoding("UTF8") does not change the way getParameterValue |
! +----------------------------------------------------------------------------+
! |        Bug #: 461                         Product: Tomcat 3                |
! |       Status: UNCONFIRMED                 Version: Unknown                 |
! |   Resolution:                            Platform: All                     |
! |     Severity: Normal                   OS/Version: All                     |
! |     Priority: High                      Component: Unknown                 |
! +----------------------------------------------------------------------------+
! |  Assigned To: Larry.Isaacs@sas.com                                         |
! |  Reported By: tim_tye@usa.net                                              |
! |      CC list: Cc:                                                          |
! +----------------------------------------------------------------------------+
! |          URL:                                                              |
! +============================================================================+
! |                              DESCRIPTION                                   |
  I am testing the support of UNICODE input and output for Java servlets and JSP's.  I have a very simple servlet that creates a web page with a single entry field.  When the ServletResponse is created, I use setContentType("text/html charset=UTF-8") to tell the web browsers this page contains eight bit compressed unicode.  This works, the data is placed from the servlet onto the web page in UTF8 style, and displayed by the web browser (IE and Netscape) correctly.  
  However, when data is entered by the web browser, the getParameterValues method ignores the character encoding and maps each byte to a seperate character.  It fails even when I use the setCharacterEncoding("UTF8") method on the request before reading any parameter names or values.  
! Support of character encoding on request input is critical to any users outside of Latin-1.