You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lenya.apache.org by bu...@apache.org on 2009/02/12 16:04:05 UTC

DO NOT REPLY [Bug 46706] New: Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

https://issues.apache.org/bugzilla/show_bug.cgi?id=46706

           Summary: Hexadecimal and decimal represenations of characters are
                    converted to the utf-8 character by the oneform editor.
           Product: Lenya
           Version: 2.0.2
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Form Editor
        AssignedTo: dev@lenya.apache.org
        ReportedBy: lambert.utz@uniklinik-freiburg.de


Hexadecimal and decimal represenations of characters are converted to the utf-8
character by the oneform editor.
A   is transformed to a blank. If there are serveral   they are
converted to blanks and they are shown as one blank. So it's impossible to have
serveral leading blanks as in lenya 1.2.x


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


DO NOT REPLY [Bug 46706] Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46706





--- Comment #4 from Andreas Hartmann <an...@apache.org>  2009-02-12 08:04:21 PST ---
Using SAX instead would be virtually impossible because we use XPath to specify
the link attributes, which doesn't sound like a good idea now :(

Maybe we should allow a different, more SAX-compatible way to specifiy the link
attributes, similar to the configuration of the LinkRewritingTransformer
classes:

<link-attribute namespace="http://www.w3.org/1999/xhtml" element="a"
attribute="href"/>


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


DO NOT REPLY [Bug 46706] Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46706





--- Comment #1 from Andreas Hartmann <an...@apache.org>  2009-02-12 07:46:16 PST ---
The change happens when the document is written. If the document already
contains a &#160; it is displayed correctly in the editor.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


DO NOT REPLY [Bug 46706] Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46706





--- Comment #2 from Andreas Hartmann <an...@apache.org>  2009-02-12 07:49:53 PST ---
The content parameter contains the &#160; entity when the usecase handler is
called.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


DO NOT REPLY [Bug 46706] Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46706





--- Comment #5 from Andreas Hartmann <an...@apache.org>  2009-02-12 08:07:37 PST ---
(In reply to comment #4)

> Maybe we should allow a different, more SAX-compatible way to specifiy the link
> attributes, similar to the configuration of the LinkRewritingTransformer
> classes:

BTW, in Lenya 2.2 I introduced two classes for this purpose:

* LinkRewriteAttributes
* LinkRewriteAttribute

A LinkRewriteAttributes object (singleton) is basically the declaration of the
link attributes of a document type. I think thas can be generalized.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


DO NOT REPLY [Bug 46706] Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46706





--- Comment #3 from Andreas Hartmann <an...@apache.org>  2009-02-12 07:51:44 PST ---
I guess the problem is the DOM conversion. Unfortunately it is necessary for
link rewriting.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


DO NOT REPLY [Bug 46706] Hexadecimal and decimal represenations of characters are converted to the utf-8 character by the oneform editor.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=46706





--- Comment #6 from J <ne...@apache.org>  2009-02-16 13:12:10 PST ---
what part of the rewriting behaviour is the problem? imho, an entity should be
equivalent to the corresponding utf-8 character, and any transformation between
those two is a valid identity operation in the semantic space (if not the
lexical one).
i wonder what lambert is trying to accomplish here?

would it help to optionally convert everything that is not 7bit-ascii into
numerical entities?


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org