You are viewing a plain text version of this content. The canonical link for it is here.
Posted to fop-dev@xmlgraphics.apache.org by chz <sa...@gmx.de> on 2004/03/25 19:06:29 UTC

Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

When I look into FOText class again, the word boundary analysis seems
not very elegant to me. Perhaps Java's java.text.BreakIterator could do
the same thing?
The BreakIterator origins from the ICU project and they state that they
even handle Thai correctly, which has no word separator like the space
character and needs therefore a dictionary based solution.

Regards,
Christian


Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

Posted by John Austin <jw...@sympatico.ca>.
On Thu, 2004-03-25 at 19:08, Glen Mazza wrote:
> Ich bin confused--ist chz (sacl@gmx.de)--Christian
> Geisert oder anderer Christian?  The bugzilla entry
> lists chz as being "Christian Z", so I'm not sure whom
> I'm speaking with!  

So we shouldn't all be running around with multiple e-mail
identities ?

My excuse is, I used that e-mail address years ago when I opened
my first Bugzilla account.
 

-- 
John Austin <jw...@sympatico.ca>

Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

Posted by "Christian Z." <sa...@gmx.de>.
Am Do, den 25.03.2004 schrieb Glen Mazza um 22:38:
> Ich bin confused--ist chz (sacl@gmx.de)--Christian
> Geisert oder anderer Christian?  The bugzilla entry
> lists chz as being "Christian Z", so I'm not sure whom
> I'm speaking with!  

Christian Ziesemer
ch        z

I adjusted that.

> If the language has no space (Japanese, or Thai I
> guess) then this particular portion, of whitespace
> removal is not relevant I would guess--I think
> dictionaries are needed for hyphenation, not space
> removal, no?

I don't know the whole context the FOText class is used, so if it's only
used for space removal -- then of course you've nothing to do in Thai
and therefore limited use for changing. :-)

Christian Z.


Re: DO NOT REPLY [Bug 27901] - TextCharIterator.remove() does not work properly

Posted by Glen Mazza <gr...@yahoo.com>.
Ich bin confused--ist chz (sacl@gmx.de)--Christian
Geisert oder anderer Christian?  The bugzilla entry
lists chz as being "Christian Z", so I'm not sure whom
I'm speaking with!  

Glen

--- chz <sa...@gmx.de> wrote:
> 
> The BreakIterator origins from the ICU project and
> they state that they
> even handle Thai correctly, which has no word
> separator like the space
> character and needs therefore a dictionary based
> solution.
> 

That business logic is I believe already handled in
another class (FOBlock? I believe--the one calling the
current iterators), so these iterators are for actual
space removal in the Western sense.  (Of course, what
you mention may need tuning/fixing in FOBlock as
well.)

If the language has no space (Japanese, or Thai I
guess) then this particular portion, of whitespace
removal is not relevant I would guess--I think
dictionaries are needed for hyphenation, not space
removal, no?

Glen