You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@harmony.apache.org by Vladimir Strigun <vs...@gmail.com> on 2007/04/09 10:14:10 UTC

[contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Hi all!

I'm happy to announce one more contribution to harmony on behalf of
Intel. Provided implementation of charset encoders/decoders is
intended to replace the ICU-based charsets encoding/decoding
operations. The code was developed in clean-room environment inside
Intel and I'd like you to play with it and include to current Harmony
tree.

The package could be found there:
HARMONY-3593

The algorithms for charsets encoding/decoding differs from that of
ICU, all charsets are generated from current Harmony or any other
implementation of Java and could be properly integrated into current
nio_char module. The archive contains source files for 6 charsets:
GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
implementation of CharsetProvider; generator for other Charsets and
native part. I've tested the package with more that 90 charsets, and
all benchmarks and tests passed with new bundle. Additionally I have
significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
for antlr and ~5-8x for xalan.

The main advantages of the package are the following:
  - Code for every charset is generated by CharsetGenerator, thus, if
some modification would be necessary we need just correct generator
and re-generate all sources.
  - We use 2 different encoders and decoders for java and direct
buffers. Since most applications use java heap buffers, unlike
existing implementation it doesn't produce lots of native calls to
perform encoding/decoding operations on the java buffers those
significantly improving performance. This is the main reason why we
have such a significant boost for Dacapo.
  - Charset tables for encoding/decoding are stored in appropriate classes.

Since the package contains implementation for 6 charsets only,
documentations how to generate and build additional charsets you could
find in README file from contributed package.

Please do not hesitate to contact me for more details.

Thanks,
Vladimir.

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> >
> > Hi all!
> >
> > I'm happy to announce one more contribution to harmony on behalf of
> > Intel. Provided implementation of charset encoders/decoders is
> > intended to replace the ICU-based charsets encoding/decoding
> > operations. The code was developed in clean-room environment inside
> > Intel and I'd like you to play with it and include to current Harmony
> > tree.
> >
> > The package could be found there:
> > HARMONY-3593
> >
> > The algorithms for charsets encoding/decoding differs from that of
> > ICU, all charsets are generated from current Harmony or any other
> > implementation of Java and could be properly integrated into current
> > nio_char module. The archive contains source files for 6 charsets:
> > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > implementation of CharsetProvider; generator for other Charsets and
> > native part. I've tested the package with more that 90 charsets, and
> > all benchmarks and tests passed with new bundle. Additionally I have
> > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > for antlr and ~5-8x for xalan.
> >
> > The main advantages of the package are the following:
> >   - Code for every charset is generated by CharsetGenerator, thus, if
> > some modification would be necessary we need just correct generator
> > and re-generate all sources.
> >   - We use 2 different encoders and decoders for java and direct
> > buffers. Since most applications use java heap buffers, unlike
> > existing implementation it doesn't produce lots of native calls to
> > perform encoding/decoding operations on the java buffers those
> > significantly improving performance. This is the main reason why we
> > have such a significant boost for Dacapo.
> >   - Charset tables for encoding/decoding are stored in appropriate
> > classes.
> >
> > Since the package contains implementation for 6 charsets only,
> > documentations how to generate and build additional charsets you could
> > find in README file from contributed package.
> >
> > Please do not hesitate to contact me for more details.
>
>
> Hi Vladimir,
>
> Does it include the source code of hycharset.dll?
>

Andrew,

Instruction how to built native and java code you could find in the
issue description.
I've attached hycharset.dll, nio_char.jar and nio_char-src.jar for
testing purpose only. You either could use attached files or generate
sources and build code yourself.

Vladimir.

> Thanks,
> > Vladimir.
> >
>
>
>
> --
> Best regards,
> Andrew Zhang
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Andrew Zhang <zh...@gmail.com>.
On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
>
> Hi all!
>
> I'm happy to announce one more contribution to harmony on behalf of
> Intel. Provided implementation of charset encoders/decoders is
> intended to replace the ICU-based charsets encoding/decoding
> operations. The code was developed in clean-room environment inside
> Intel and I'd like you to play with it and include to current Harmony
> tree.
>
> The package could be found there:
> HARMONY-3593
>
> The algorithms for charsets encoding/decoding differs from that of
> ICU, all charsets are generated from current Harmony or any other
> implementation of Java and could be properly integrated into current
> nio_char module. The archive contains source files for 6 charsets:
> GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> implementation of CharsetProvider; generator for other Charsets and
> native part. I've tested the package with more that 90 charsets, and
> all benchmarks and tests passed with new bundle. Additionally I have
> significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> for antlr and ~5-8x for xalan.
>
> The main advantages of the package are the following:
>   - Code for every charset is generated by CharsetGenerator, thus, if
> some modification would be necessary we need just correct generator
> and re-generate all sources.
>   - We use 2 different encoders and decoders for java and direct
> buffers. Since most applications use java heap buffers, unlike
> existing implementation it doesn't produce lots of native calls to
> perform encoding/decoding operations on the java buffers those
> significantly improving performance. This is the main reason why we
> have such a significant boost for Dacapo.
>   - Charset tables for encoding/decoding are stored in appropriate
> classes.
>
> Since the package contains implementation for 6 charsets only,
> documentations how to generate and build additional charsets you could
> find in README file from contributed package.
>
> Please do not hesitate to contact me for more details.


Hi Vladimir,

Does it include the source code of hycharset.dll?

Thanks,
> Vladimir.
>



-- 
Best regards,
Andrew Zhang

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/10/07, Yang Paulex <pa...@gmail.com> wrote:
> 2007/4/9, Vladimir Strigun <vs...@gmail.com>:
> >
> > On 4/9/07, Yang Paulex <pa...@gmail.com> wrote:
> > > 2007/4/9, Vladimir Strigun <vs...@gmail.com>:
> > > >
> > > > Hi all!
> > > >
> > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > Intel. Provided implementation of charset encoders/decoders is
> > > > intended to replace the ICU-based charsets encoding/decoding
> > > > operations. The code was developed in clean-room environment inside
> > > > Intel and I'd like you to play with it and include to current Harmony
> > > > tree.
> > > >
> > > > The package could be found there:
> > > > HARMONY-3593
> > > >
> > > > The algorithms for charsets encoding/decoding differs from that of
> > > > ICU, all charsets are generated from current Harmony or any other
> > > > implementation of Java and could be properly integrated into current
> > > > nio_char module. The archive contains source files for 6 charsets:
> > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > implementation of CharsetProvider; generator for other Charsets and
> > > > native part. I've tested the package with more that 90 charsets, and
> > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > for antlr and ~5-8x for xalan.
> > > >
> > > > The main advantages of the package are the following:
> > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > some modification would be necessary we need just correct generator
> > > > and re-generate all sources.
> > > >   - We use 2 different encoders and decoders for java and direct
> > > > buffers. Since most applications use java heap buffers, unlike
> > > > existing implementation it doesn't produce lots of native calls to
> > > > perform encoding/decoding operations on the java buffers those
> > > > significantly improving performance. This is the main reason why we
> > > > have such a significant boost for Dacapo.
> > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > classes.
> > > >
> > > > Since the package contains implementation for 6 charsets only,
> > > > documentations how to generate and build additional charsets you could
> > > > find in README file from contributed package.
> > > >
> > > > Please do not hesitate to contact me for more details.
> > > >
> > > > Thanks,
> > > > Vladimir.
> > > >
> > >
> > > Good work, Vladimir and team in Intel!
> > >
> > > I'm also interested in a pure Java charset conversion provider for
> > Harmony,
> > > because the frequent JNI invocation in ICU4JNI(current Harmony charset
> > > provider) may impair the performance when dealing with small chunk of
> > bytes.
> > > But I noticed that, in this contribution, US_ASCII, ISO_8859_1 and
> > GB18030
> > > are implemented in native C, just out of interest, any special reason
> > not to
> > > implemented in Java?
> >
> > As I wrote ealier, 2 branches of code generated for every
> > encoder/decoder: java and native one. Native branch used only for
> > processing native byte buffers. Native branch could be easily removed
> > by small modification of generators, but performance measurements
> > shows that it's better to use native decoders/encoders
> > in case of native buffers.
>
>
> So there may be two implementations(one native, one java) for  one
> charsets?

exactly

> Thanks.
> > Vladimir.
> >
> > > --
> > > Paulex Yang
> > > China Software Development laboratory
> > > IBM
> > >
> >
>
>
>
> --
> Paulex Yang
> China Software Development laboratory
> IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Yang Paulex <pa...@gmail.com>.
2007/4/9, Vladimir Strigun <vs...@gmail.com>:
>
> On 4/9/07, Yang Paulex <pa...@gmail.com> wrote:
> > 2007/4/9, Vladimir Strigun <vs...@gmail.com>:
> > >
> > > Hi all!
> > >
> > > I'm happy to announce one more contribution to harmony on behalf of
> > > Intel. Provided implementation of charset encoders/decoders is
> > > intended to replace the ICU-based charsets encoding/decoding
> > > operations. The code was developed in clean-room environment inside
> > > Intel and I'd like you to play with it and include to current Harmony
> > > tree.
> > >
> > > The package could be found there:
> > > HARMONY-3593
> > >
> > > The algorithms for charsets encoding/decoding differs from that of
> > > ICU, all charsets are generated from current Harmony or any other
> > > implementation of Java and could be properly integrated into current
> > > nio_char module. The archive contains source files for 6 charsets:
> > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > implementation of CharsetProvider; generator for other Charsets and
> > > native part. I've tested the package with more that 90 charsets, and
> > > all benchmarks and tests passed with new bundle. Additionally I have
> > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > for antlr and ~5-8x for xalan.
> > >
> > > The main advantages of the package are the following:
> > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > some modification would be necessary we need just correct generator
> > > and re-generate all sources.
> > >   - We use 2 different encoders and decoders for java and direct
> > > buffers. Since most applications use java heap buffers, unlike
> > > existing implementation it doesn't produce lots of native calls to
> > > perform encoding/decoding operations on the java buffers those
> > > significantly improving performance. This is the main reason why we
> > > have such a significant boost for Dacapo.
> > >   - Charset tables for encoding/decoding are stored in appropriate
> > > classes.
> > >
> > > Since the package contains implementation for 6 charsets only,
> > > documentations how to generate and build additional charsets you could
> > > find in README file from contributed package.
> > >
> > > Please do not hesitate to contact me for more details.
> > >
> > > Thanks,
> > > Vladimir.
> > >
> >
> > Good work, Vladimir and team in Intel!
> >
> > I'm also interested in a pure Java charset conversion provider for
> Harmony,
> > because the frequent JNI invocation in ICU4JNI(current Harmony charset
> > provider) may impair the performance when dealing with small chunk of
> bytes.
> > But I noticed that, in this contribution, US_ASCII, ISO_8859_1 and
> GB18030
> > are implemented in native C, just out of interest, any special reason
> not to
> > implemented in Java?
>
> As I wrote ealier, 2 branches of code generated for every
> encoder/decoder: java and native one. Native branch used only for
> processing native byte buffers. Native branch could be easily removed
> by small modification of generators, but performance measurements
> shows that it's better to use native decoders/encoders
> in case of native buffers.


So there may be two implementations(one native, one java) for  one
charsets?

Thanks.
> Vladimir.
>
> > --
> > Paulex Yang
> > China Software Development laboratory
> > IBM
> >
>



-- 
Paulex Yang
China Software Development laboratory
IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Yang Paulex <pa...@gmail.com> wrote:
> 2007/4/9, Vladimir Strigun <vs...@gmail.com>:
> >
> > Hi all!
> >
> > I'm happy to announce one more contribution to harmony on behalf of
> > Intel. Provided implementation of charset encoders/decoders is
> > intended to replace the ICU-based charsets encoding/decoding
> > operations. The code was developed in clean-room environment inside
> > Intel and I'd like you to play with it and include to current Harmony
> > tree.
> >
> > The package could be found there:
> > HARMONY-3593
> >
> > The algorithms for charsets encoding/decoding differs from that of
> > ICU, all charsets are generated from current Harmony or any other
> > implementation of Java and could be properly integrated into current
> > nio_char module. The archive contains source files for 6 charsets:
> > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > implementation of CharsetProvider; generator for other Charsets and
> > native part. I've tested the package with more that 90 charsets, and
> > all benchmarks and tests passed with new bundle. Additionally I have
> > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > for antlr and ~5-8x for xalan.
> >
> > The main advantages of the package are the following:
> >   - Code for every charset is generated by CharsetGenerator, thus, if
> > some modification would be necessary we need just correct generator
> > and re-generate all sources.
> >   - We use 2 different encoders and decoders for java and direct
> > buffers. Since most applications use java heap buffers, unlike
> > existing implementation it doesn't produce lots of native calls to
> > perform encoding/decoding operations on the java buffers those
> > significantly improving performance. This is the main reason why we
> > have such a significant boost for Dacapo.
> >   - Charset tables for encoding/decoding are stored in appropriate
> > classes.
> >
> > Since the package contains implementation for 6 charsets only,
> > documentations how to generate and build additional charsets you could
> > find in README file from contributed package.
> >
> > Please do not hesitate to contact me for more details.
> >
> > Thanks,
> > Vladimir.
> >
>
> Good work, Vladimir and team in Intel!
>
> I'm also interested in a pure Java charset conversion provider for Harmony,
> because the frequent JNI invocation in ICU4JNI(current Harmony charset
> provider) may impair the performance when dealing with small chunk of bytes.
> But I noticed that, in this contribution, US_ASCII, ISO_8859_1 and GB18030
> are implemented in native C, just out of interest, any special reason not to
> implemented in Java?

As I wrote ealier, 2 branches of code generated for every
encoder/decoder: java and native one. Native branch used only for
processing native byte buffers. Native branch could be easily removed
by small modification of generators, but performance measurements
shows that it's better to use native decoders/encoders
in case of native buffers.

Thanks.
Vladimir.

> --
> Paulex Yang
> China Software Development laboratory
> IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Yang Paulex <pa...@gmail.com>.
2007/4/9, Vladimir Strigun <vs...@gmail.com>:
>
> Hi all!
>
> I'm happy to announce one more contribution to harmony on behalf of
> Intel. Provided implementation of charset encoders/decoders is
> intended to replace the ICU-based charsets encoding/decoding
> operations. The code was developed in clean-room environment inside
> Intel and I'd like you to play with it and include to current Harmony
> tree.
>
> The package could be found there:
> HARMONY-3593
>
> The algorithms for charsets encoding/decoding differs from that of
> ICU, all charsets are generated from current Harmony or any other
> implementation of Java and could be properly integrated into current
> nio_char module. The archive contains source files for 6 charsets:
> GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> implementation of CharsetProvider; generator for other Charsets and
> native part. I've tested the package with more that 90 charsets, and
> all benchmarks and tests passed with new bundle. Additionally I have
> significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> for antlr and ~5-8x for xalan.
>
> The main advantages of the package are the following:
>   - Code for every charset is generated by CharsetGenerator, thus, if
> some modification would be necessary we need just correct generator
> and re-generate all sources.
>   - We use 2 different encoders and decoders for java and direct
> buffers. Since most applications use java heap buffers, unlike
> existing implementation it doesn't produce lots of native calls to
> perform encoding/decoding operations on the java buffers those
> significantly improving performance. This is the main reason why we
> have such a significant boost for Dacapo.
>   - Charset tables for encoding/decoding are stored in appropriate
> classes.
>
> Since the package contains implementation for 6 charsets only,
> documentations how to generate and build additional charsets you could
> find in README file from contributed package.
>
> Please do not hesitate to contact me for more details.
>
> Thanks,
> Vladimir.
>

Good work, Vladimir and team in Intel!

I'm also interested in a pure Java charset conversion provider for Harmony,
because the frequent JNI invocation in ICU4JNI(current Harmony charset
provider) may impair the performance when dealing with small chunk of bytes.
But I noticed that, in this contribution, US_ASCII, ISO_8859_1 and GB18030
are implemented in native C, just out of interest, any special reason not to
implemented in Java?

-- 
Paulex Yang
China Software Development laboratory
IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Volynets, Vera <ve...@intel.com> wrote:
> Hi,
> there is one small point about our independence from icu.
> Vm uses icu4c during classfile parsing. It would be great if we have the
> same functionality and don't use icu4c.
> Do you work on it?

Vera,

With the new bundle we still dependent on ICU. No, I'm not working on
replacement for icu4c.

Thanks.
Vladimir.



> WBR,Vera!
>
> -----Original Message-----
> From: Tony Wu [mailto:wuyuehao@gmail.com]
> Sent: Monday, April 09, 2007 1:50 PM
> To: dev@harmony.apache.org
> Subject: Re: [contribution] Contribution of charset encoders/decoders
> for NIO_CHAR module
>
> I wonder if it is possible to make it as built-in charset provider and
> make icu as an extension?
>
> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > amazing work.
> > generating the charsets...
> >
> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > >
> > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > Super cool!!!
> > > > > > Does it mean we're not dependent on ICU any more?
> > > > >
> > > > > Unfortunately not all charsets supported with attached bundle.
> The
> > > > > list of supported charsets you could find in README file.
> > > >
> > > >
> > > > Hi Vladimir, not unfortunately at all. :)
> > > >
> > > > We're on the way to be independent of ICU, right? ;)
> > >
> > > Yes, you right,  we're on the way :)
> > >
> > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > >
> > > > > > > Hi all!
> > > > > > >
> > > > > > > I'm happy to announce one more contribution to harmony on
> behalf of
> > > > > > > Intel. Provided implementation of charset encoders/decoders
> is
> > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > operations. The code was developed in clean-room environment
> inside
> > > > > > > Intel and I'd like you to play with it and include to
> current Harmony
> > > > > > > tree.
> > > > > > >
> > > > > > > The package could be found there:
> > > > > > > HARMONY-3593
> > > > > > >
> > > > > > > The algorithms for charsets encoding/decoding differs from
> that of
> > > > > > > ICU, all charsets are generated from current Harmony or any
> other
> > > > > > > implementation of Java and could be properly integrated into
> current
> > > > > > > nio_char module. The archive contains source files for 6
> charsets:
> > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE,
> UTF-16LE;
> > > > > > > implementation of CharsetProvider; generator for other
> Charsets and
> > > > > > > native part. I've tested the package with more that 90
> charsets, and
> > > > > > > all benchmarks and tests passed with new bundle.
> Additionally I have
> > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan
> benchmarks with
> > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have
> 2.5x boost
> > > > > > > for antlr and ~5-8x for xalan.
> > > > > > >
> > > > > > > The main advantages of the package are the following:
> > > > > > >   - Code for every charset is generated by CharsetGenerator,
> thus, if
> > > > > > > some modification would be necessary we need just correct
> generator
> > > > > > > and re-generate all sources.
> > > > > > >   - We use 2 different encoders and decoders for java and
> direct
> > > > > > > buffers. Since most applications use java heap buffers,
> unlike
> > > > > > > existing implementation it doesn't produce lots of native
> calls to
> > > > > > > perform encoding/decoding operations on the java buffers
> those
> > > > > > > significantly improving performance. This is the main reason
> why we
> > > > > > > have such a significant boost for Dacapo.
> > > > > > >   - Charset tables for encoding/decoding are stored in
> appropriate
> > > > > > > classes.
> > > > > > >
> > > > > > > Since the package contains implementation for 6 charsets
> only,
> > > > > > > documentations how to generate and build additional charsets
> you could
> > > > > > > find in README file from contributed package.
> > > > > > >
> > > > > > > Please do not hesitate to contact me for more details.
> > > > > > >
> > > > > > > Thanks,
> > > > > > > Vladimir.
> > > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Best regards,
> > > > > > Andrew Zhang
> > > > > >
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Best regards,
> > > > Andrew Zhang
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Richard Liang <ri...@gmail.com>.
On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> I wonder if it is possible to make it as built-in charset provider and
> make icu as an extension?

Good question? ;-)

>
> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > amazing work.
> > generating the charsets...
> >
> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > >
> > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > Super cool!!!
> > > > > > Does it mean we're not dependent on ICU any more?
> > > > >
> > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > list of supported charsets you could find in README file.
> > > >
> > > >
> > > > Hi Vladimir, not unfortunately at all. :)
> > > >
> > > > We're on the way to be independent of ICU, right? ;)
> > >
> > > Yes, you right,  we're on the way :)
> > >
> > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > >
> > > > > > > Hi all!
> > > > > > >
> > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > tree.
> > > > > > >
> > > > > > > The package could be found there:
> > > > > > > HARMONY-3593
> > > > > > >
> > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > for antlr and ~5-8x for xalan.
> > > > > > >
> > > > > > > The main advantages of the package are the following:
> > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > some modification would be necessary we need just correct generator
> > > > > > > and re-generate all sources.
> > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > have such a significant boost for Dacapo.
> > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > classes.
> > > > > > >
> > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > find in README file from contributed package.
> > > > > > >
> > > > > > > Please do not hesitate to contact me for more details.
> > > > > > >
> > > > > > > Thanks,
> > > > > > > Vladimir.
> > > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Best regards,
> > > > > > Andrew Zhang
> > > > > >
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Best regards,
> > > > Andrew Zhang
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Richard Liang

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
What version of VM you used?
I used Harmony-vme-win.IA32-v4.zip

On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> Hi Vladimir,
> I did exactly as you said. I think probably because we uses different vm?
>
> On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > Tony,
> >
> > Am I correctly understood that you used binary version?
> > I used the next steps:
> > 1. download harmony classlib/trunk (rev 527065)
> > 2. ant fetch-depends
> > 3. ant
> > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > 5. copy hycharset.dll attached to JIRA to jre/bin
> > 6. java Hello
> > Hello
> >
> > Could you please describe the steps you used to get the error?
> >
> > Thanks.
> > Vladimir.
> >
> > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi,
> > >
> > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > unfortunately I got following error :(
> > >
> > > did I miss something?
> > >
> > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > >        at java/lang/System.<clinit> (System.java:110)
> > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > >        at java/lang/Thread.initialize (Thread.java:339)
> > >        at java/lang/Thread.<init> (Thread.java:126)
> > > java/lang/NullPointerException
> > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > >        at java/lang/System.<clinit> (System.java:110)
> > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > >        at java/lang/Thread.initialize (Thread.java:339)
> > >        at java/lang/Thread.<init> (Thread.java:126)
> > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > JVMJ9VM009E J9VMDllMain failed
> > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > FAILED to invoke JVM.
> > >
> > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > Hi Tony,
> > > >
> > > > header files could be generated by javac tool :)
> > > > let me know if you will have additional problems.
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Hi Vladimir,
> > > > >
> > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > your patch, still I can not build from src.
> > > > >
> > > > > trying the binary..
> > > > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > And I have not found a makefile for native?
> > > > > >
> > > > > > Tony,
> > > > > >
> > > > > > I've attached makefiles to the JIRA issue.
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > Hi Vladimir,
> > > > > > > >
> > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > >
> > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > >
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > make icu as an extension?
> > > > > > > > >
> > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > implemented charsets used from ICU.
> > > > > > > > >
> > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-mac-cyrillic
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > amazing work.
> > > > > > > > > > > generating the charsets...
> > > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > >
> > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > >
> > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Vladimir
good job! It imporves the performace explicitly, especially in ISO8859-1.
I've attached my latest result compared with the previous one. It
shows that we got 3x faster in ISO-8859-1 and 2x faster in UTF-8 with
large input. Another interesting result is that it takes a little more
time when decoding a small input in UTF-8 ;-)

On 5/8/07, Vladimir Strigun <vs...@gmail.com> wrote:
> Hi Tony,
>
> I've slightly updated charset bundle:
> - all charset definitions were moved to CharsetProviderImpl class
> - additional branch for heap buffers were added to java part of
> encoders/decoders
> - charset generator for one-byte-per-char charset was updated accordingly
>
> New version could be found in Harmony-3593. Results of performance
> tests execution I attached to JIRA issue as well. Updated performance
> tests yopu could find in Harmony-3709 issue.
>
> Within the new branch for heap buffer processing I have 2x speedup
> without degradation on Dacapo benchmark.
>
> Could you please try the new version of the charsets?
>
> Thanks.
> Vladimir.
>
> On 4/20/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi Vladimir,
> > I put the testcase on 3709 with an ant script for convenience.
> >
> > On 4/19/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > Tony,
> > >
> > > Thanks for the performance analysis. This info is for J9 VM, right?
> > > Could you please attach your tests to JIRA?
> > > I'd like to try it with DRLVM.
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Here is the report for large input data
> > > >
> > > > large input data:
> > > > ----------------------
> > > > 251k for GB18030
> > > > 133k for UTF-8 and 8859-1
> > > >
> > > > encoding 1,000 times
> > > > ================
> > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > UTF-8   563.0
> > > > GB18030 5187.0
> > > > 8859-1  219.0
> > > >
> > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > UTF-8   2016.0
> > > > GB18030 3828.0
> > > > 8859-1  1703.0
> > > >
> > > > decoding 1,000 times
> > > > ================
> > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > UTF-8   375.0
> > > > GB18030 2578.0
> > > > 8859-1  156.0
> > > >
> > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > UTF-8   2610.0
> > > > GB18030 3781.0
> > > > 8859-1  1297.0
> > > >
> > > >
> > > >
> > > > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > sorry I sent previous mail by mistake :(
> > > > >
> > > > > The result is,
> > > > >
> > > > > small input data:
> > > > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > > > "abcdEfG" for UTF-8 and 8859-1
> > > > > encode 1,000,000 times
> > > > > =================
> > > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > > UTF-8   8047.0
> > > > > GB18030 5203.0
> > > > > 8859-1  3468.0
> > > > >
> > > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > > UTF-8   1610.0
> > > > > GB18030 4328.0
> > > > > 8859-1  1282.0
> > > > >
> > > > > decoding 1,000,000 times
> > > > > ==================
> > > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > > UTF-8   5453.0
> > > > > GB18030 4703.0
> > > > > 8859-1  3407.0
> > > > >
> > > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > > UTF-8   1860.0
> > > > > GB18030 2671.0
> > > > > 8859-1  2234.0
> > > > >
> > > > > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > Hi Vladimir,
> > > > > >
> > > > > > I did some tests on my thinkpad with following env. The reslut is
> > > > > > milliseconds, the less the better.
> > > > > >
> > > > > > win xp sp2
> > > > > > 2G RAM
> > > > > > Intel Pentium 1.86GHZ
> > > > > >
> > > > > > small input data:
> > > > > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > > > > "abcdEfG" for UTF-8 and 8859-1
> > > > > > encode 1,000,000 times
> > > > > >
> > > > > > icu
> > > > > > intel
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > large input data:
> > > > > > 251k for GB18030
> > > > > > 133k for UTF-8 and 8859-1
> > > > > >
> > > > > >
> > > > > >
> > > > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > > >
> > > > > >
> > > > > >
> > > > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > > >
> > > > > >
> > > > > > On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > Vladimir,
> > > > > > > I'm writing some testcases for the performance of encoding and
> > > > > > > decoding, I'll show you next week.
> > > > > > >
> > > > > > > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > Tony,
> > > > > > > >
> > > > > > > > The reason of failure on you machine was absence of default charset
> > > > > > > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > > > > > > file include explicit check of charset availability in additional
> > > > > > > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > > > > > > failure after locale change on my machine.
> > > > > > > >
> > > > > > > > I will merge all new charsets to one provider and update contribution
> > > > > > > > package soon.
> > > > > > > >
> > > > > > > > By the way, have you run any performance measurements with new charset
> > > > > > > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > > > > > > for antlr and xalan Dacapo benchmarks.
> > > > > > > >
> > > > > > > > ICU impl:
> > > > > > > > antlr: 2684 msec
> > > > > > > > xalan: 2372 msec
> > > > > > > >
> > > > > > > > new charsets:
> > > > > > > > antlr: 1854 msec
> > > > > > > > xalan: 1846 msec
> > > > > > > >
> > > > > > > > Thanks.
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > And I 'm interested in the difference between the 2 binary versions.
> > > > > > > > > Actually I encounter similar problem in my working.
> > > > > > > > >
> > > > > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > The new binary version works, Thanks Vladimir.
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > Tony,
> > > > > > > > > > >
> > > > > > > > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > > > > > > > you please try it?
> > > > > > > > > > >
> > > > > > > > > > > Thanks.
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > Tony,
> > > > > > > > > > > > >
> > > > > > > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > > > > > > I used the next steps:
> > > > > > > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > > > > > > 2. ant fetch-depends
> > > > > > > > > > > > > 3. ant
> > > > > > > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > > > > > > 6. java Hello
> > > > > > > > > > > > > Hello
> > > > > > > > > > > > >
> > > > > > > > > > > > > Could you please describe the steps you used to get the error?
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > Hi,
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > > > > > > > unfortunately I got following error :(
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > did I miss something?
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > > > > java/lang/NullPointerException
> > > > > > > > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > > > > > > FAILED to invoke JVM.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > Hi Tony,
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > trying the binary..
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Tony,
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
Hi Tony,

I've slightly updated charset bundle:
- all charset definitions were moved to CharsetProviderImpl class
- additional branch for heap buffers were added to java part of
encoders/decoders
- charset generator for one-byte-per-char charset was updated accordingly

New version could be found in Harmony-3593. Results of performance
tests execution I attached to JIRA issue as well. Updated performance
tests yopu could find in Harmony-3709 issue.

Within the new branch for heap buffer processing I have 2x speedup
without degradation on Dacapo benchmark.

Could you please try the new version of the charsets?

Thanks.
Vladimir.

On 4/20/07, Tony Wu <wu...@gmail.com> wrote:
> Hi Vladimir,
> I put the testcase on 3709 with an ant script for convenience.
>
> On 4/19/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > Tony,
> >
> > Thanks for the performance analysis. This info is for J9 VM, right?
> > Could you please attach your tests to JIRA?
> > I'd like to try it with DRLVM.
> >
> > Thanks.
> > Vladimir.
> >
> > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > Here is the report for large input data
> > >
> > > large input data:
> > > ----------------------
> > > 251k for GB18030
> > > 133k for UTF-8 and 8859-1
> > >
> > > encoding 1,000 times
> > > ================
> > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > UTF-8   563.0
> > > GB18030 5187.0
> > > 8859-1  219.0
> > >
> > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > UTF-8   2016.0
> > > GB18030 3828.0
> > > 8859-1  1703.0
> > >
> > > decoding 1,000 times
> > > ================
> > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > UTF-8   375.0
> > > GB18030 2578.0
> > > 8859-1  156.0
> > >
> > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > UTF-8   2610.0
> > > GB18030 3781.0
> > > 8859-1  1297.0
> > >
> > >
> > >
> > > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > > sorry I sent previous mail by mistake :(
> > > >
> > > > The result is,
> > > >
> > > > small input data:
> > > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > > "abcdEfG" for UTF-8 and 8859-1
> > > > encode 1,000,000 times
> > > > =================
> > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > UTF-8   8047.0
> > > > GB18030 5203.0
> > > > 8859-1  3468.0
> > > >
> > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > UTF-8   1610.0
> > > > GB18030 4328.0
> > > > 8859-1  1282.0
> > > >
> > > > decoding 1,000,000 times
> > > > ==================
> > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > UTF-8   5453.0
> > > > GB18030 4703.0
> > > > 8859-1  3407.0
> > > >
> > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > UTF-8   1860.0
> > > > GB18030 2671.0
> > > > 8859-1  2234.0
> > > >
> > > > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Hi Vladimir,
> > > > >
> > > > > I did some tests on my thinkpad with following env. The reslut is
> > > > > milliseconds, the less the better.
> > > > >
> > > > > win xp sp2
> > > > > 2G RAM
> > > > > Intel Pentium 1.86GHZ
> > > > >
> > > > > small input data:
> > > > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > > > "abcdEfG" for UTF-8 and 8859-1
> > > > > encode 1,000,000 times
> > > > >
> > > > > icu
> > > > > intel
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > large input data:
> > > > > 251k for GB18030
> > > > > 133k for UTF-8 and 8859-1
> > > > >
> > > > >
> > > > >
> > > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > > >
> > > > >
> > > > >
> > > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > > >
> > > > >
> > > > > On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > Vladimir,
> > > > > > I'm writing some testcases for the performance of encoding and
> > > > > > decoding, I'll show you next week.
> > > > > >
> > > > > > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > Tony,
> > > > > > >
> > > > > > > The reason of failure on you machine was absence of default charset
> > > > > > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > > > > > file include explicit check of charset availability in additional
> > > > > > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > > > > > failure after locale change on my machine.
> > > > > > >
> > > > > > > I will merge all new charsets to one provider and update contribution
> > > > > > > package soon.
> > > > > > >
> > > > > > > By the way, have you run any performance measurements with new charset
> > > > > > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > > > > > for antlr and xalan Dacapo benchmarks.
> > > > > > >
> > > > > > > ICU impl:
> > > > > > > antlr: 2684 msec
> > > > > > > xalan: 2372 msec
> > > > > > >
> > > > > > > new charsets:
> > > > > > > antlr: 1854 msec
> > > > > > > xalan: 1846 msec
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > And I 'm interested in the difference between the 2 binary versions.
> > > > > > > > Actually I encounter similar problem in my working.
> > > > > > > >
> > > > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > The new binary version works, Thanks Vladimir.
> > > > > > > > >
> > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > Tony,
> > > > > > > > > >
> > > > > > > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > > > > > > you please try it?
> > > > > > > > > >
> > > > > > > > > > Thanks.
> > > > > > > > > > Vladimir.
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > > > > > > >
> > > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > Tony,
> > > > > > > > > > > >
> > > > > > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > > > > > I used the next steps:
> > > > > > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > > > > > 2. ant fetch-depends
> > > > > > > > > > > > 3. ant
> > > > > > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > > > > > 6. java Hello
> > > > > > > > > > > > Hello
> > > > > > > > > > > >
> > > > > > > > > > > > Could you please describe the steps you used to get the error?
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks.
> > > > > > > > > > > > Vladimir.
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > Hi,
> > > > > > > > > > > > >
> > > > > > > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > > > > > > unfortunately I got following error :(
> > > > > > > > > > > > >
> > > > > > > > > > > > > did I miss something?
> > > > > > > > > > > > >
> > > > > > > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > > > java/lang/NullPointerException
> > > > > > > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > > > > > FAILED to invoke JVM.
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > Hi Tony,
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > trying the binary..
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Tony,
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Hi Vladimir,
I put the testcase on 3709 with an ant script for convenience.

On 4/19/07, Vladimir Strigun <vs...@gmail.com> wrote:
> Tony,
>
> Thanks for the performance analysis. This info is for J9 VM, right?
> Could you please attach your tests to JIRA?
> I'd like to try it with DRLVM.
>
> Thanks.
> Vladimir.
>
> On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > Here is the report for large input data
> >
> > large input data:
> > ----------------------
> > 251k for GB18030
> > 133k for UTF-8 and 8859-1
> >
> > encoding 1,000 times
> > ================
> > com.ibm.icu4jni.charset.CharsetProiverICU
> > UTF-8   563.0
> > GB18030 5187.0
> > 8859-1  219.0
> >
> > org.apache.harmony.niochar.CharsetProviderimplStd
> > UTF-8   2016.0
> > GB18030 3828.0
> > 8859-1  1703.0
> >
> > decoding 1,000 times
> > ================
> > com.ibm.icu4jni.charset.CharsetProiverICU
> > UTF-8   375.0
> > GB18030 2578.0
> > 8859-1  156.0
> >
> > org.apache.harmony.niochar.CharsetProviderimplStd
> > UTF-8   2610.0
> > GB18030 3781.0
> > 8859-1  1297.0
> >
> >
> >
> > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > sorry I sent previous mail by mistake :(
> > >
> > > The result is,
> > >
> > > small input data:
> > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > "abcdEfG" for UTF-8 and 8859-1
> > > encode 1,000,000 times
> > > =================
> > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > UTF-8   8047.0
> > > GB18030 5203.0
> > > 8859-1  3468.0
> > >
> > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > UTF-8   1610.0
> > > GB18030 4328.0
> > > 8859-1  1282.0
> > >
> > > decoding 1,000,000 times
> > > ==================
> > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > UTF-8   5453.0
> > > GB18030 4703.0
> > > 8859-1  3407.0
> > >
> > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > UTF-8   1860.0
> > > GB18030 2671.0
> > > 8859-1  2234.0
> > >
> > > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Hi Vladimir,
> > > >
> > > > I did some tests on my thinkpad with following env. The reslut is
> > > > milliseconds, the less the better.
> > > >
> > > > win xp sp2
> > > > 2G RAM
> > > > Intel Pentium 1.86GHZ
> > > >
> > > > small input data:
> > > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > > "abcdEfG" for UTF-8 and 8859-1
> > > > encode 1,000,000 times
> > > >
> > > > icu
> > > > intel
> > > >
> > > >
> > > >
> > > >
> > > > large input data:
> > > > 251k for GB18030
> > > > 133k for UTF-8 and 8859-1
> > > >
> > > >
> > > >
> > > > com.ibm.icu4jni.charset.CharsetProiverICU
> > > >
> > > >
> > > >
> > > > org.apache.harmony.niochar.CharsetProviderimplStd
> > > >
> > > >
> > > > On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Vladimir,
> > > > > I'm writing some testcases for the performance of encoding and
> > > > > decoding, I'll show you next week.
> > > > >
> > > > > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > Tony,
> > > > > >
> > > > > > The reason of failure on you machine was absence of default charset
> > > > > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > > > > file include explicit check of charset availability in additional
> > > > > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > > > > failure after locale change on my machine.
> > > > > >
> > > > > > I will merge all new charsets to one provider and update contribution
> > > > > > package soon.
> > > > > >
> > > > > > By the way, have you run any performance measurements with new charset
> > > > > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > > > > for antlr and xalan Dacapo benchmarks.
> > > > > >
> > > > > > ICU impl:
> > > > > > antlr: 2684 msec
> > > > > > xalan: 2372 msec
> > > > > >
> > > > > > new charsets:
> > > > > > antlr: 1854 msec
> > > > > > xalan: 1846 msec
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > And I 'm interested in the difference between the 2 binary versions.
> > > > > > > Actually I encounter similar problem in my working.
> > > > > > >
> > > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > The new binary version works, Thanks Vladimir.
> > > > > > > >
> > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > Tony,
> > > > > > > > >
> > > > > > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > > > > > you please try it?
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > Hi Vladimir,
> > > > > > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > Tony,
> > > > > > > > > > >
> > > > > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > > > > I used the next steps:
> > > > > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > > > > 2. ant fetch-depends
> > > > > > > > > > > 3. ant
> > > > > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > > > > 6. java Hello
> > > > > > > > > > > Hello
> > > > > > > > > > >
> > > > > > > > > > > Could you please describe the steps you used to get the error?
> > > > > > > > > > >
> > > > > > > > > > > Thanks.
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > Hi,
> > > > > > > > > > > >
> > > > > > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > > > > > unfortunately I got following error :(
> > > > > > > > > > > >
> > > > > > > > > > > > did I miss something?
> > > > > > > > > > > >
> > > > > > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > > java/lang/NullPointerException
> > > > > > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > > > > FAILED to invoke JVM.
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > Hi Tony,
> > > > > > > > > > > > >
> > > > > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > trying the binary..
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Tony,
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
Tony,

Thanks for the performance analysis. This info is for J9 VM, right?
Could you please attach your tests to JIRA?
I'd like to try it with DRLVM.

Thanks.
Vladimir.

On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> Here is the report for large input data
>
> large input data:
> ----------------------
> 251k for GB18030
> 133k for UTF-8 and 8859-1
>
> encoding 1,000 times
> ================
> com.ibm.icu4jni.charset.CharsetProiverICU
> UTF-8   563.0
> GB18030 5187.0
> 8859-1  219.0
>
> org.apache.harmony.niochar.CharsetProviderimplStd
> UTF-8   2016.0
> GB18030 3828.0
> 8859-1  1703.0
>
> decoding 1,000 times
> ================
> com.ibm.icu4jni.charset.CharsetProiverICU
> UTF-8   375.0
> GB18030 2578.0
> 8859-1  156.0
>
> org.apache.harmony.niochar.CharsetProviderimplStd
> UTF-8   2610.0
> GB18030 3781.0
> 8859-1  1297.0
>
>
>
> On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > sorry I sent previous mail by mistake :(
> >
> > The result is,
> >
> > small input data:
> > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > "abcdEfG" for UTF-8 and 8859-1
> > encode 1,000,000 times
> > =================
> > com.ibm.icu4jni.charset.CharsetProiverICU
> > UTF-8   8047.0
> > GB18030 5203.0
> > 8859-1  3468.0
> >
> > org.apache.harmony.niochar.CharsetProviderimplStd
> > UTF-8   1610.0
> > GB18030 4328.0
> > 8859-1  1282.0
> >
> > decoding 1,000,000 times
> > ==================
> > com.ibm.icu4jni.charset.CharsetProiverICU
> > UTF-8   5453.0
> > GB18030 4703.0
> > 8859-1  3407.0
> >
> > org.apache.harmony.niochar.CharsetProviderimplStd
> > UTF-8   1860.0
> > GB18030 2671.0
> > 8859-1  2234.0
> >
> > On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi Vladimir,
> > >
> > > I did some tests on my thinkpad with following env. The reslut is
> > > milliseconds, the less the better.
> > >
> > > win xp sp2
> > > 2G RAM
> > > Intel Pentium 1.86GHZ
> > >
> > > small input data:
> > > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > > "abcdEfG" for UTF-8 and 8859-1
> > > encode 1,000,000 times
> > >
> > > icu
> > > intel
> > >
> > >
> > >
> > >
> > > large input data:
> > > 251k for GB18030
> > > 133k for UTF-8 and 8859-1
> > >
> > >
> > >
> > > com.ibm.icu4jni.charset.CharsetProiverICU
> > >
> > >
> > >
> > > org.apache.harmony.niochar.CharsetProviderimplStd
> > >
> > >
> > > On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Vladimir,
> > > > I'm writing some testcases for the performance of encoding and
> > > > decoding, I'll show you next week.
> > > >
> > > > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > Tony,
> > > > >
> > > > > The reason of failure on you machine was absence of default charset
> > > > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > > > file include explicit check of charset availability in additional
> > > > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > > > failure after locale change on my machine.
> > > > >
> > > > > I will merge all new charsets to one provider and update contribution
> > > > > package soon.
> > > > >
> > > > > By the way, have you run any performance measurements with new charset
> > > > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > > > for antlr and xalan Dacapo benchmarks.
> > > > >
> > > > > ICU impl:
> > > > > antlr: 2684 msec
> > > > > xalan: 2372 msec
> > > > >
> > > > > new charsets:
> > > > > antlr: 1854 msec
> > > > > xalan: 1846 msec
> > > > >
> > > > > Thanks.
> > > > > Vladimir.
> > > > >
> > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > And I 'm interested in the difference between the 2 binary versions.
> > > > > > Actually I encounter similar problem in my working.
> > > > > >
> > > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > The new binary version works, Thanks Vladimir.
> > > > > > >
> > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > Tony,
> > > > > > > >
> > > > > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > > > > you please try it?
> > > > > > > >
> > > > > > > > Thanks.
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > Hi Vladimir,
> > > > > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > > > > >
> > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > Tony,
> > > > > > > > > >
> > > > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > > > I used the next steps:
> > > > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > > > 2. ant fetch-depends
> > > > > > > > > > 3. ant
> > > > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > > > 6. java Hello
> > > > > > > > > > Hello
> > > > > > > > > >
> > > > > > > > > > Could you please describe the steps you used to get the error?
> > > > > > > > > >
> > > > > > > > > > Thanks.
> > > > > > > > > > Vladimir.
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > Hi,
> > > > > > > > > > >
> > > > > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > > > > unfortunately I got following error :(
> > > > > > > > > > >
> > > > > > > > > > > did I miss something?
> > > > > > > > > > >
> > > > > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > java/lang/NullPointerException
> > > > > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > > > FAILED to invoke JVM.
> > > > > > > > > > >
> > > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > Hi Tony,
> > > > > > > > > > > >
> > > > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks.
> > > > > > > > > > > > Vladimir.
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > >
> > > > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > > > >
> > > > > > > > > > > > > trying the binary..
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Tony,
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Here is the report for large input data

large input data:
----------------------
251k for GB18030
133k for UTF-8 and 8859-1

encoding 1,000 times
================
com.ibm.icu4jni.charset.CharsetProiverICU
UTF-8   563.0
GB18030 5187.0
8859-1  219.0

org.apache.harmony.niochar.CharsetProviderimplStd
UTF-8   2016.0
GB18030 3828.0
8859-1  1703.0

decoding 1,000 times
================
com.ibm.icu4jni.charset.CharsetProiverICU
UTF-8   375.0
GB18030 2578.0
8859-1  156.0

org.apache.harmony.niochar.CharsetProviderimplStd
UTF-8   2610.0
GB18030 3781.0
8859-1  1297.0



On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> sorry I sent previous mail by mistake :(
>
> The result is,
>
> small input data:
> "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> "abcdEfG" for UTF-8 and 8859-1
> encode 1,000,000 times
> =================
> com.ibm.icu4jni.charset.CharsetProiverICU
> UTF-8   8047.0
> GB18030 5203.0
> 8859-1  3468.0
>
> org.apache.harmony.niochar.CharsetProviderimplStd
> UTF-8   1610.0
> GB18030 4328.0
> 8859-1  1282.0
>
> decoding 1,000,000 times
> ==================
> com.ibm.icu4jni.charset.CharsetProiverICU
> UTF-8   5453.0
> GB18030 4703.0
> 8859-1  3407.0
>
> org.apache.harmony.niochar.CharsetProviderimplStd
> UTF-8   1860.0
> GB18030 2671.0
> 8859-1  2234.0
>
> On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi Vladimir,
> >
> > I did some tests on my thinkpad with following env. The reslut is
> > milliseconds, the less the better.
> >
> > win xp sp2
> > 2G RAM
> > Intel Pentium 1.86GHZ
> >
> > small input data:
> > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > "abcdEfG" for UTF-8 and 8859-1
> > encode 1,000,000 times
> >
> > icu
> > intel
> >
> >
> >
> >
> > large input data:
> > 251k for GB18030
> > 133k for UTF-8 and 8859-1
> >
> >
> >
> > com.ibm.icu4jni.charset.CharsetProiverICU
> >
> >
> >
> > org.apache.harmony.niochar.CharsetProviderimplStd
> >
> >
> > On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > > Vladimir,
> > > I'm writing some testcases for the performance of encoding and
> > > decoding, I'll show you next week.
> > >
> > > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > Tony,
> > > >
> > > > The reason of failure on you machine was absence of default charset
> > > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > > file include explicit check of charset availability in additional
> > > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > > failure after locale change on my machine.
> > > >
> > > > I will merge all new charsets to one provider and update contribution
> > > > package soon.
> > > >
> > > > By the way, have you run any performance measurements with new charset
> > > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > > for antlr and xalan Dacapo benchmarks.
> > > >
> > > > ICU impl:
> > > > antlr: 2684 msec
> > > > xalan: 2372 msec
> > > >
> > > > new charsets:
> > > > antlr: 1854 msec
> > > > xalan: 1846 msec
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > And I 'm interested in the difference between the 2 binary versions.
> > > > > Actually I encounter similar problem in my working.
> > > > >
> > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > The new binary version works, Thanks Vladimir.
> > > > > >
> > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > Tony,
> > > > > > >
> > > > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > > > you please try it?
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > Hi Vladimir,
> > > > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > > > >
> > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > Tony,
> > > > > > > > >
> > > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > > I used the next steps:
> > > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > > 2. ant fetch-depends
> > > > > > > > > 3. ant
> > > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > > 6. java Hello
> > > > > > > > > Hello
> > > > > > > > >
> > > > > > > > > Could you please describe the steps you used to get the error?
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > Hi,
> > > > > > > > > >
> > > > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > > > unfortunately I got following error :(
> > > > > > > > > >
> > > > > > > > > > did I miss something?
> > > > > > > > > >
> > > > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > java/lang/NullPointerException
> > > > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > > FAILED to invoke JVM.
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > Hi Tony,
> > > > > > > > > > >
> > > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > > >
> > > > > > > > > > > Thanks.
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > >
> > > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > > >
> > > > > > > > > > > > trying the binary..
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > > >
> > > > > > > > > > > > > Tony,
> > > > > > > > > > > > >
> > > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Leo Li <li...@gmail.com>.
Seems that ICU4JNI has advantage on large data blocks while harmony charset
provider takes the upper hand on small inputs. It is so obvious.:)

How to choose? It is a problem.

ICU4JNI has to pay for JNI calls but it has the advantage of native
language, but I think the advantage is temporary. The cost of JNI is born-in
and not avoidable while there is still space for harmony charset provider to
improve its performance. Furthermore, the advantage of native language
is also controversial, especially comparing with a well-tuned and
fully-trained VM and a classlib with high quality.(This is what we all are
now fighting for.)


On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
>
> sorry I sent previous mail by mistake :(
>
> The result is,
>
> small input data:
> "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> "abcdEfG" for UTF-8 and 8859-1
> encode 1,000,000 times
> =================
> com.ibm.icu4jni.charset.CharsetProiverICU
> UTF-8   8047.0
> GB18030 5203.0
> 8859-1  3468.0
>
> org.apache.harmony.niochar.CharsetProviderimplStd
> UTF-8   1610.0
> GB18030 4328.0
> 8859-1  1282.0
>
> decoding 1,000,000 times
> ==================
> com.ibm.icu4jni.charset.CharsetProiverICU
> UTF-8   5453.0
> GB18030 4703.0
> 8859-1  3407.0
>
> org.apache.harmony.niochar.CharsetProviderimplStd
> UTF-8   1860.0
> GB18030 2671.0
> 8859-1  2234.0
>
> On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi Vladimir,
> >
> > I did some tests on my thinkpad with following env. The reslut is
> > milliseconds, the less the better.
> >
> > win xp sp2
> > 2G RAM
> > Intel Pentium 1.86GHZ
> >
> > small input data:
> > "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> > "abcdEfG" for UTF-8 and 8859-1
> > encode 1,000,000 times
> >
> > icu
> > intel
> >
> >
> >
> >
> > large input data:
> > 251k for GB18030
> > 133k for UTF-8 and 8859-1
> >
> >
> >
> > com.ibm.icu4jni.charset.CharsetProiverICU
> >
> >
> >
> > org.apache.harmony.niochar.CharsetProviderimplStd
> >
> >
> > On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > > Vladimir,
> > > I'm writing some testcases for the performance of encoding and
> > > decoding, I'll show you next week.
> > >
> > > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > Tony,
> > > >
> > > > The reason of failure on you machine was absence of default charset
> > > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > > file include explicit check of charset availability in additional
> > > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > > failure after locale change on my machine.
> > > >
> > > > I will merge all new charsets to one provider and update
> contribution
> > > > package soon.
> > > >
> > > > By the way, have you run any performance measurements with new
> charset
> > > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > > for antlr and xalan Dacapo benchmarks.
> > > >
> > > > ICU impl:
> > > > antlr: 2684 msec
> > > > xalan: 2372 msec
> > > >
> > > > new charsets:
> > > > antlr: 1854 msec
> > > > xalan: 1846 msec
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > And I 'm interested in the difference between the 2 binary
> versions.
> > > > > Actually I encounter similar problem in my working.
> > > > >
> > > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > The new binary version works, Thanks Vladimir.
> > > > > >
> > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > Tony,
> > > > > > >
> > > > > > > I've attached new binary version of nio_char.jar to JIRA
> issue. Could
> > > > > > > you please try it?
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > Hi Vladimir,
> > > > > > > > I did exactly as you said. I think probably because we uses
> different vm?
> > > > > > > >
> > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > Tony,
> > > > > > > > >
> > > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > > I used the next steps:
> > > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > > 2. ant fetch-depends
> > > > > > > > > 3. ant
> > > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > > 6. java Hello
> > > > > > > > > Hello
> > > > > > > > >
> > > > > > > > > Could you please describe the steps you used to get the
> error?
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > Hi,
> > > > > > > > > >
> > > > > > > > > > I just put the dll to jre/bin and replace the jar at
> /lib/boot.
> > > > > > > > > > unfortunately I got following error :(
> > > > > > > > > >
> > > > > > > > > > did I miss something?
> > > > > > > > > >
> > > > > > > > > > Exception in thread "main"
> java/lang/ExceptionInInitializerError
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (
> J9VMInternals.java:195)
> > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native
> Method)
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (
> J9VMInternals.java:177)
> > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (
> ClassLoader.java:68)
> > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > java/lang/NullPointerException
> > > > > > > > > >        at java/nio/charset/Charset.forNameInternal (
> Charset.java:507)
> > > > > > > > > >        at java/nio/charset/Charset.isSupported (
> Charset.java:578)
> > > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (
> String.java:68)
> > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native
> Method)
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (
> J9VMInternals.java:177)
> > > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native
> Method)
> > > > > > > > > >        at java/lang/J9VMInternals.initialize (
> J9VMInternals.java:177)
> > > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (
> ClassLoader.java:68)
> > > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > > JVMJ9VM015W Initialization error for library
> jclclear_23(14):
> > > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > > FAILED to invoke JVM.
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > Hi Tony,
> > > > > > > > > > >
> > > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > > >
> > > > > > > > > > > Thanks.
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > >
> > > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h
> files found in
> > > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > > >
> > > > > > > > > > > > trying the binary..
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com>
> wrote:
> > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > > >
> > > > > > > > > > > > > Tony,
> > > > > > > > > > > > >
> > > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win
> xp.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > I encounter a minor problem when try to build
> native code.
> > > > > > > > > > > > > > >
> ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > > instead of .../src/native/niochar/shared
> according to readme.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <
> vstrigun@gmail.com> wrote:
> > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com>
> wrote:
> > > > > > > > > > > > > > > > > I wonder if it is possible to make it as
> built-in charset provider and
> > > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Attached test bundle, instruction and patch
> for current code combined
> > > > > > > > > > > > > > > > new implementation with ICU. So, I have the
> same 228 charsets
> > > > > > > > > > > > > > > > available - about 90 charsets used from the
> new bundle, and not
> > > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The full list of charsets supported with the
> current bundle + ICU:
> > > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > Big5     class=
> org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > > Big5-HKSCS       class=
> org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > > BOCU-1   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > CESU-8   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > cp850    class=
> org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > > cp851    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > cp856    class=
> org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > > cp857    class=
> org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > > cp858    class=
> org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > > cp860    class=
> org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > > cp861    class=
> org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > > cp862    class=
> org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > > cp863    class=
> org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > > cp864    class=
> org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > > cp865    class=
> org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > > cp866    class=
> org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > > cp868    class=
> org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > > cp869    class=
> org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > > cp922    class=
> org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > > EUC-JP   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > EUC-KR   class=
> org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > > GB18030  class=
> org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > > GB2312   class=
> org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > > GB_2312-80       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > GBK      class=
> org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > > hp-roman8        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > HZ-GB-2312       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > IBM-Thai         class=
> org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > > IBM01140         class=
> org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > > IBM01141         class=
> org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > > IBM01142         class=
> org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > > IBM01143         class=
> org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > > IBM01144         class=
> org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > > IBM01145         class=
> org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > > IBM01146         class=
> org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > > IBM01147         class=
> org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > > IBM01148         class=
> org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > > IBM01149         class=
> org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > > IBM037   class=
> org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > > IBM1026  class=
> org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > > IBM1047  class=
> org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > > IBM273   class=
> org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > > IBM277   class=
> org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > > IBM278   class=
> org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > > IBM280   class=
> org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > > IBM284   class=
> org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > > IBM285   class=
> org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > > IBM290   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > IBM297   class=
> org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > > IBM367   class=
> org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > IBM420   class=
> org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > > IBM424   class=
> org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > > IBM437   class=
> org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > > IBM500   class=
> org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > > IBM775   class=
> org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > > IBM852   class=
> org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > > IBM855   class=
> org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > > IBM870   class=
> org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > > IBM871   class=
> org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > > IBM918   class=
> org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > > ISO-2022-CN      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-JP      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-JP-2    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-2022-KR      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > ISO-8859-1       class=
> org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > > ISO-8859-13      class=
> org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > > ISO-8859-15      class=
> org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > > ISO-8859-2       class=
> org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > > ISO-8859-3       class=
> org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > > ISO-8859-4       class=
> org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > > ISO-8859-5       class=
> org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > > ISO-8859-6       class=
> org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > > ISO-8859-7       class=
> org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > > ISO-8859-8       class=
> org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > > ISO-8859-9       class=
> org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > > JIS_Encoding     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > JIS_X0201        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > KOI8-R   class=
> org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > > KOI8-U   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > KSC_5601         class=
> org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > > macintosh        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > SCSU     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > Shift_JIS        class=
> org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > > TIS-620  class=
> org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > > US-ASCII         class=
> org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > > UTF-16   class=
> org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > > UTF-16BE         class=
> org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > > UTF-16LE         class=
> org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > > UTF-32   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-32BE         class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-32LE         class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-7    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > UTF-8    class=
> org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > > windows-1250     class=
> org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > > windows-1251     class=
> org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > > windows-1252     class=
> org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > > windows-1253     class=
> org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > > windows-1254     class=
> org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > > windows-1255     class=
> org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > > windows-1256     class=
> org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > > windows-1257     class=
> org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > > windows-1258     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ebcdic-xml-us  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > > x-ibm-1047-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1140-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1142-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1143-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1144-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1145-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1146-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1147-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1148-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1149-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1153-s390  class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-12712-s390         class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-16804-s390         class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-25546      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-37-s390    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-be       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-de       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-gu       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-ka       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-ma       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-or       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-pa       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-ta       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-iscii-te       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-JIS7   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-JIS8   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-1        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-11       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-16       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-17       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-18       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-19       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-2        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-3        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-4        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-5        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-6        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-LMBCS-8        class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-mac-centraleurroman    class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > > class=
> org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > > x-mac-greek      class=
> org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > > x-mac-turkish    class=
> org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-windows-874-2000       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > > x-windows-949-2000       class=
> com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com>
> wrote:
> > > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <
> vstrigun@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <
> zhanghuangzhu@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <
> vstrigun@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <
> zhanghuangzhu@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent
> on ICU any more?
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets
> supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > > list of supported charsets you
> could find in README file.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at
> all. :)
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > We're on the way to be independent
> of ICU, right? ;)
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <
> vstrigun@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more
> contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation
> of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > > intended to replace the
> ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > > operations. The code was
> developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play
> with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > The package could be found
> there:
> > > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets
> encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are
> generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > > implementation of Java and
> could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive
> contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1,
> UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > > implementation of
> CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > > native part. I've tested the
> package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests
> passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > > significant boost for
> Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM
> and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > The main advantages of the
> package are the following:
> > > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is
> generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > > some modification would be
> necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > > >   - We use 2 different
> encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > > buffers. Since most
> applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > > existing implementation it
> doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding
> operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > > significantly improving
> performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > > have such a significant boost
> for Dacapo.
> > > > > > > > > > > > > > > > > > > > > > >   - Charset tables for
> encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Since the package contains
> implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > > documentations how to generate
> and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > > find in README file from
> contributed package.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to
> contact me for more details.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>



-- 
Leo Li
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
sorry I sent previous mail by mistake :(

The result is,

small input data:
"\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
"abcdEfG" for UTF-8 and 8859-1
encode 1,000,000 times
=================
com.ibm.icu4jni.charset.CharsetProiverICU
UTF-8   8047.0
GB18030 5203.0
8859-1  3468.0

org.apache.harmony.niochar.CharsetProviderimplStd
UTF-8   1610.0
GB18030 4328.0
8859-1  1282.0

decoding 1,000,000 times
==================
com.ibm.icu4jni.charset.CharsetProiverICU
UTF-8   5453.0
GB18030 4703.0
8859-1  3407.0

org.apache.harmony.niochar.CharsetProviderimplStd
UTF-8   1860.0
GB18030 2671.0
8859-1  2234.0

On 4/19/07, Tony Wu <wu...@gmail.com> wrote:
> Hi Vladimir,
>
> I did some tests on my thinkpad with following env. The reslut is
> milliseconds, the less the better.
>
> win xp sp2
> 2G RAM
> Intel Pentium 1.86GHZ
>
> small input data:
> "\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
> "abcdEfG" for UTF-8 and 8859-1
> encode 1,000,000 times
>
> icu
> intel
>
>
>
>
> large input data:
> 251k for GB18030
> 133k for UTF-8 and 8859-1
>
>
>
> com.ibm.icu4jni.charset.CharsetProiverICU
>
>
>
> org.apache.harmony.niochar.CharsetProviderimplStd
>
>
> On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> > Vladimir,
> > I'm writing some testcases for the performance of encoding and
> > decoding, I'll show you next week.
> >
> > On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > Tony,
> > >
> > > The reason of failure on you machine was absence of default charset
> > > for you locale in CharsetProviderImplStd. Updated version of binary
> > > file include explicit check of charset availability in additional
> > > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > > failure after locale change on my machine.
> > >
> > > I will merge all new charsets to one provider and update contribution
> > > package soon.
> > >
> > > By the way, have you run any performance measurements with new charset
> > > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > > for antlr and xalan Dacapo benchmarks.
> > >
> > > ICU impl:
> > > antlr: 2684 msec
> > > xalan: 2372 msec
> > >
> > > new charsets:
> > > antlr: 1854 msec
> > > xalan: 1846 msec
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > And I 'm interested in the difference between the 2 binary versions.
> > > > Actually I encounter similar problem in my working.
> > > >
> > > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > The new binary version works, Thanks Vladimir.
> > > > >
> > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > Tony,
> > > > > >
> > > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > > you please try it?
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > Hi Vladimir,
> > > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > > >
> > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > Tony,
> > > > > > > >
> > > > > > > > Am I correctly understood that you used binary version?
> > > > > > > > I used the next steps:
> > > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > > 2. ant fetch-depends
> > > > > > > > 3. ant
> > > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > > 6. java Hello
> > > > > > > > Hello
> > > > > > > >
> > > > > > > > Could you please describe the steps you used to get the error?
> > > > > > > >
> > > > > > > > Thanks.
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > Hi,
> > > > > > > > >
> > > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > > unfortunately I got following error :(
> > > > > > > > >
> > > > > > > > > did I miss something?
> > > > > > > > >
> > > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > java/lang/NullPointerException
> > > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > > FAILED to invoke JVM.
> > > > > > > > >
> > > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > Hi Tony,
> > > > > > > > > >
> > > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > > let me know if you will have additional problems.
> > > > > > > > > >
> > > > > > > > > > Thanks.
> > > > > > > > > > Vladimir.
> > > > > > > > > >
> > > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > >
> > > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > > >
> > > > > > > > > > > trying the binary..
> > > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > > >
> > > > > > > > > > > > Tony,
> > > > > > > > > > > >
> > > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks.
> > > > > > > > > > > > Vladimir.
> > > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Hi Vladimir,

I did some tests on my thinkpad with following env. The reslut is
milliseconds, the less the better.

win xp sp2
2G RAM
Intel Pentium 1.86GHZ

small input data:
"\u662f\u975e\u6210\u8d25\u8f6c\u5934\u7a7a" for GB18030
"abcdEfG" for UTF-8 and 8859-1
encode 1,000,000 times

icu
intel




large input data:
251k for GB18030
133k for UTF-8 and 8859-1



com.ibm.icu4jni.charset.CharsetProiverICU



org.apache.harmony.niochar.CharsetProviderimplStd


On 4/14/07, Tony Wu <wu...@gmail.com> wrote:
> Vladimir,
> I'm writing some testcases for the performance of encoding and
> decoding, I'll show you next week.
>
> On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > Tony,
> >
> > The reason of failure on you machine was absence of default charset
> > for you locale in CharsetProviderImplStd. Updated version of binary
> > file include explicit check of charset availability in additional
> > provider: CharsetProviderImplExt. I'm also able to reproduce you
> > failure after locale change on my machine.
> >
> > I will merge all new charsets to one provider and update contribution
> > package soon.
> >
> > By the way, have you run any performance measurements with new charset
> > impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> > for antlr and xalan Dacapo benchmarks.
> >
> > ICU impl:
> > antlr: 2684 msec
> > xalan: 2372 msec
> >
> > new charsets:
> > antlr: 1854 msec
> > xalan: 1846 msec
> >
> > Thanks.
> > Vladimir.
> >
> > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > And I 'm interested in the difference between the 2 binary versions.
> > > Actually I encounter similar problem in my working.
> > >
> > > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > > The new binary version works, Thanks Vladimir.
> > > >
> > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > Tony,
> > > > >
> > > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > > you please try it?
> > > > >
> > > > > Thanks.
> > > > > Vladimir.
> > > > >
> > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > Hi Vladimir,
> > > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > > >
> > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > Tony,
> > > > > > >
> > > > > > > Am I correctly understood that you used binary version?
> > > > > > > I used the next steps:
> > > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > > 2. ant fetch-depends
> > > > > > > 3. ant
> > > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > > 6. java Hello
> > > > > > > Hello
> > > > > > >
> > > > > > > Could you please describe the steps you used to get the error?
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > Hi,
> > > > > > > >
> > > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > > unfortunately I got following error :(
> > > > > > > >
> > > > > > > > did I miss something?
> > > > > > > >
> > > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > java/lang/NullPointerException
> > > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > > FAILED to invoke JVM.
> > > > > > > >
> > > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > Hi Tony,
> > > > > > > > >
> > > > > > > > > header files could be generated by javac tool :)
> > > > > > > > > let me know if you will have additional problems.
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > Hi Vladimir,
> > > > > > > > > >
> > > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > > your patch, still I can not build from src.
> > > > > > > > > >
> > > > > > > > > > trying the binary..
> > > > > > > > > >
> > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > > >
> > > > > > > > > > > Tony,
> > > > > > > > > > >
> > > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > > >
> > > > > > > > > > > Thanks.
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > > >
> > > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Vladimir,
I'm writing some testcases for the performance of encoding and
decoding, I'll show you next week.

On 4/12/07, Vladimir Strigun <vs...@gmail.com> wrote:
> Tony,
>
> The reason of failure on you machine was absence of default charset
> for you locale in CharsetProviderImplStd. Updated version of binary
> file include explicit check of charset availability in additional
> provider: CharsetProviderImplExt. I'm also able to reproduce you
> failure after locale change on my machine.
>
> I will merge all new charsets to one provider and update contribution
> package soon.
>
> By the way, have you run any performance measurements with new charset
> impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
> for antlr and xalan Dacapo benchmarks.
>
> ICU impl:
> antlr: 2684 msec
> xalan: 2372 msec
>
> new charsets:
> antlr: 1854 msec
> xalan: 1846 msec
>
> Thanks.
> Vladimir.
>
> On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > And I 'm interested in the difference between the 2 binary versions.
> > Actually I encounter similar problem in my working.
> >
> > On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > > The new binary version works, Thanks Vladimir.
> > >
> > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > Tony,
> > > >
> > > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > > you please try it?
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Hi Vladimir,
> > > > > I did exactly as you said. I think probably because we uses different vm?
> > > > >
> > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > Tony,
> > > > > >
> > > > > > Am I correctly understood that you used binary version?
> > > > > > I used the next steps:
> > > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > > 2. ant fetch-depends
> > > > > > 3. ant
> > > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > > 6. java Hello
> > > > > > Hello
> > > > > >
> > > > > > Could you please describe the steps you used to get the error?
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > Hi,
> > > > > > >
> > > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > > unfortunately I got following error :(
> > > > > > >
> > > > > > > did I miss something?
> > > > > > >
> > > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > java/lang/NullPointerException
> > > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > > FAILED to invoke JVM.
> > > > > > >
> > > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > Hi Tony,
> > > > > > > >
> > > > > > > > header files could be generated by javac tool :)
> > > > > > > > let me know if you will have additional problems.
> > > > > > > >
> > > > > > > > Thanks.
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > Hi Vladimir,
> > > > > > > > >
> > > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > > your patch, still I can not build from src.
> > > > > > > > >
> > > > > > > > > trying the binary..
> > > > > > > > >
> > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > > >
> > > > > > > > > > Tony,
> > > > > > > > > >
> > > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > > >
> > > > > > > > > > Thanks.
> > > > > > > > > > Vladimir.
> > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > > >
> > > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > > >
> > > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > > >
> > > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
Tony,

The reason of failure on you machine was absence of default charset
for you locale in CharsetProviderImplStd. Updated version of binary
file include explicit check of charset availability in additional
provider: CharsetProviderImplExt. I'm also able to reproduce you
failure after locale change on my machine.

I will merge all new charsets to one provider and update contribution
package soon.

By the way, have you run any performance measurements with new charset
impl on J9 machine? My measurements on Woodcrest IA32 shows speedup
for antlr and xalan Dacapo benchmarks.

ICU impl:
antlr: 2684 msec
xalan: 2372 msec

new charsets:
antlr: 1854 msec
xalan: 1846 msec

Thanks.
Vladimir.

On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> And I 'm interested in the difference between the 2 binary versions.
> Actually I encounter similar problem in my working.
>
> On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> > The new binary version works, Thanks Vladimir.
> >
> > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > Tony,
> > >
> > > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > > you please try it?
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Hi Vladimir,
> > > > I did exactly as you said. I think probably because we uses different vm?
> > > >
> > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > Tony,
> > > > >
> > > > > Am I correctly understood that you used binary version?
> > > > > I used the next steps:
> > > > > 1. download harmony classlib/trunk (rev 527065)
> > > > > 2. ant fetch-depends
> > > > > 3. ant
> > > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > > 6. java Hello
> > > > > Hello
> > > > >
> > > > > Could you please describe the steps you used to get the error?
> > > > >
> > > > > Thanks.
> > > > > Vladimir.
> > > > >
> > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > Hi,
> > > > > >
> > > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > > unfortunately I got following error :(
> > > > > >
> > > > > > did I miss something?
> > > > > >
> > > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > java/lang/NullPointerException
> > > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > > FAILED to invoke JVM.
> > > > > >
> > > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > Hi Tony,
> > > > > > >
> > > > > > > header files could be generated by javac tool :)
> > > > > > > let me know if you will have additional problems.
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > Hi Vladimir,
> > > > > > > >
> > > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > > your patch, still I can not build from src.
> > > > > > > >
> > > > > > > > trying the binary..
> > > > > > > >
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > And I have not found a makefile for native?
> > > > > > > > >
> > > > > > > > > Tony,
> > > > > > > > >
> > > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > Hi Vladimir,
> > > > > > > > > > >
> > > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > > >
> > > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > > >
> > > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > > >
> > > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks.
> > > > > > > > > > > > Vladimir.
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
And I 'm interested in the difference between the 2 binary versions.
Actually I encounter similar problem in my working.

On 4/12/07, Tony Wu <wu...@gmail.com> wrote:
> The new binary version works, Thanks Vladimir.
>
> On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > Tony,
> >
> > I've attached new binary version of nio_char.jar to JIRA issue. Could
> > you please try it?
> >
> > Thanks.
> > Vladimir.
> >
> > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi Vladimir,
> > > I did exactly as you said. I think probably because we uses different vm?
> > >
> > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > Tony,
> > > >
> > > > Am I correctly understood that you used binary version?
> > > > I used the next steps:
> > > > 1. download harmony classlib/trunk (rev 527065)
> > > > 2. ant fetch-depends
> > > > 3. ant
> > > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > > 6. java Hello
> > > > Hello
> > > >
> > > > Could you please describe the steps you used to get the error?
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Hi,
> > > > >
> > > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > > unfortunately I got following error :(
> > > > >
> > > > > did I miss something?
> > > > >
> > > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > java/lang/NullPointerException
> > > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > >        at java/lang/System.<clinit> (System.java:110)
> > > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > > JVMJ9VM009E J9VMDllMain failed
> > > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > > FAILED to invoke JVM.
> > > > >
> > > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > Hi Tony,
> > > > > >
> > > > > > header files could be generated by javac tool :)
> > > > > > let me know if you will have additional problems.
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > Hi Vladimir,
> > > > > > >
> > > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > > your patch, still I can not build from src.
> > > > > > >
> > > > > > > trying the binary..
> > > > > > >
> > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > And I have not found a makefile for native?
> > > > > > > >
> > > > > > > > Tony,
> > > > > > > >
> > > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > > >
> > > > > > > > Thanks.
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > Hi Vladimir,
> > > > > > > > > >
> > > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > > >
> > > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > > >
> > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > > make icu as an extension?
> > > > > > > > > > >
> > > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > > >
> > > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > Thanks.
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > > amazing work.
> > > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
The new binary version works, Thanks Vladimir.

On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> Tony,
>
> I've attached new binary version of nio_char.jar to JIRA issue. Could
> you please try it?
>
> Thanks.
> Vladimir.
>
> On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi Vladimir,
> > I did exactly as you said. I think probably because we uses different vm?
> >
> > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > Tony,
> > >
> > > Am I correctly understood that you used binary version?
> > > I used the next steps:
> > > 1. download harmony classlib/trunk (rev 527065)
> > > 2. ant fetch-depends
> > > 3. ant
> > > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > > 5. copy hycharset.dll attached to JIRA to jre/bin
> > > 6. java Hello
> > > Hello
> > >
> > > Could you please describe the steps you used to get the error?
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Hi,
> > > >
> > > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > > unfortunately I got following error :(
> > > >
> > > > did I miss something?
> > > >
> > > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > > >        at java/lang/System.<clinit> (System.java:110)
> > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > java/lang/NullPointerException
> > > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > >        at java/lang/System.<clinit> (System.java:110)
> > > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > > >        at java/lang/Thread.initialize (Thread.java:339)
> > > >        at java/lang/Thread.<init> (Thread.java:126)
> > > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > > JVMJ9VM009E J9VMDllMain failed
> > > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > > FAILED to invoke JVM.
> > > >
> > > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > Hi Tony,
> > > > >
> > > > > header files could be generated by javac tool :)
> > > > > let me know if you will have additional problems.
> > > > >
> > > > > Thanks.
> > > > > Vladimir.
> > > > >
> > > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > Hi Vladimir,
> > > > > >
> > > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > > your patch, still I can not build from src.
> > > > > >
> > > > > > trying the binary..
> > > > > >
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > And I have not found a makefile for native?
> > > > > > >
> > > > > > > Tony,
> > > > > > >
> > > > > > > I've attached makefiles to the JIRA issue.
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > Hi Vladimir,
> > > > > > > > >
> > > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > > >
> > > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > > >
> > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > > make icu as an extension?
> > > > > > > > > >
> > > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > > implemented charsets used from ICU.
> > > > > > > > > >
> > > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-mac-cyrillic
> > > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > Thanks.
> > > > > > > > > > Vladimir.
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > > amazing work.
> > > > > > > > > > > > generating the charsets...
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > > >
> > > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Tony Wu
> > > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
Tony,

I've attached new binary version of nio_char.jar to JIRA issue. Could
you please try it?

Thanks.
Vladimir.

On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> Hi Vladimir,
> I did exactly as you said. I think probably because we uses different vm?
>
> On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > Tony,
> >
> > Am I correctly understood that you used binary version?
> > I used the next steps:
> > 1. download harmony classlib/trunk (rev 527065)
> > 2. ant fetch-depends
> > 3. ant
> > 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> > 5. copy hycharset.dll attached to JIRA to jre/bin
> > 6. java Hello
> > Hello
> >
> > Could you please describe the steps you used to get the error?
> >
> > Thanks.
> > Vladimir.
> >
> > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi,
> > >
> > > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > > unfortunately I got following error :(
> > >
> > > did I miss something?
> > >
> > > Exception in thread "main" java/lang/ExceptionInInitializerError
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> > >        at java/lang/System.<clinit> (System.java:110)
> > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > >        at java/lang/Thread.initialize (Thread.java:339)
> > >        at java/lang/Thread.<init> (Thread.java:126)
> > > java/lang/NullPointerException
> > >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> > >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> > >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > >        at java/lang/System.<clinit> (System.java:110)
> > >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> > >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> > >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> > >        at java/lang/Thread.initialize (Thread.java:339)
> > >        at java/lang/Thread.<init> (Thread.java:126)
> > > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > > JVMJ9VM009E J9VMDllMain failed
> > > HMYEXEL062E Internal VM error: Failed to create Java VM
> > > FAILED to invoke JVM.
> > >
> > > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > Hi Tony,
> > > >
> > > > header files could be generated by javac tool :)
> > > > let me know if you will have additional problems.
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Hi Vladimir,
> > > > >
> > > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > > your patch, still I can not build from src.
> > > > >
> > > > > trying the binary..
> > > > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > And I have not found a makefile for native?
> > > > > >
> > > > > > Tony,
> > > > > >
> > > > > > I've attached makefiles to the JIRA issue.
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > Hi Vladimir,
> > > > > > > >
> > > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > > >
> > > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > > >
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > > make icu as an extension?
> > > > > > > > >
> > > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > > implemented charsets used from ICU.
> > > > > > > > >
> > > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-737_P100-1997
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-875_P100-1995
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-mac-cyrillic
> > > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Thanks.
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > > amazing work.
> > > > > > > > > > > generating the charsets...
> > > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > > >
> > > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > > >
> > > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > --
> > > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Tony Wu
> > > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Hi Vladimir,
I did exactly as you said. I think probably because we uses different vm?

On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> Tony,
>
> Am I correctly understood that you used binary version?
> I used the next steps:
> 1. download harmony classlib/trunk (rev 527065)
> 2. ant fetch-depends
> 3. ant
> 4. Copy nio_char.jar attached to JIRA to jre/lib/boot
> 5. copy hycharset.dll attached to JIRA to jre/bin
> 6. java Hello
> Hello
>
> Could you please describe the steps you used to get the error?
>
> Thanks.
> Vladimir.
>
> On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi,
> >
> > I just put the dll to jre/bin and replace the jar at /lib/boot.
> > unfortunately I got following error :(
> >
> > did I miss something?
> >
> > Exception in thread "main" java/lang/ExceptionInInitializerError
> >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
> >        at java/lang/System.<clinit> (System.java:110)
> >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> >        at java/lang/Thread.initialize (Thread.java:339)
> >        at java/lang/Thread.<init> (Thread.java:126)
> > java/lang/NullPointerException
> >        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
> >        at java/nio/charset/Charset.isSupported (Charset.java:578)
> >        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
> >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> >        at java/lang/System.<clinit> (System.java:110)
> >        at java/lang/J9VMInternals.initializeImpl (Native Method)
> >        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
> >        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
> >        at java/lang/Thread.initialize (Thread.java:339)
> >        at java/lang/Thread.<init> (Thread.java:126)
> > JVMJ9VM015W Initialization error for library jclclear_23(14):
> > JVMJ9VM009E J9VMDllMain failed
> > HMYEXEL062E Internal VM error: Failed to create Java VM
> > FAILED to invoke JVM.
> >
> > On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > Hi Tony,
> > >
> > > header files could be generated by javac tool :)
> > > let me know if you will have additional problems.
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Hi Vladimir,
> > > >
> > > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > > your patch, still I can not build from src.
> > > >
> > > > trying the binary..
> > > >
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > And I have not found a makefile for native?
> > > > >
> > > > > Tony,
> > > > >
> > > > > I've attached makefiles to the JIRA issue.
> > > > >
> > > > > Thanks.
> > > > > Vladimir.
> > > > >
> > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > Hi Vladimir,
> > > > > > >
> > > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > > >
> > > > > > > I encounter a minor problem when try to build native code.
> > > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > > >
> > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > > make icu as an extension?
> > > > > > > >
> > > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > > implemented charsets used from ICU.
> > > > > > > >
> > > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1006_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > > x-ibm-1025_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1097_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > > x-ibm-1098_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > > x-ibm-1112_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > > x-ibm-1122_P100-1999
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > > x-ibm-1123_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > > x-ibm-1124_P100-1996
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1375_P100-2003
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-737_P100-1997
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-875_P100-1995
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-mac-cyrillic
> > > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > >
> > > > > > > >
> > > > > > > > Thanks.
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > > >
> > > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > > amazing work.
> > > > > > > > > > generating the charsets...
> > > > > > > > > >
> > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > > >
> > > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > > >
> > > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > > >
> > > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Best regards,
> > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Tony Wu
> > > > > > > > > > China Software Development Lab, IBM
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
Tony,

Am I correctly understood that you used binary version?
I used the next steps:
1. download harmony classlib/trunk (rev 527065)
2. ant fetch-depends
3. ant
4. Copy nio_char.jar attached to JIRA to jre/lib/boot
5. copy hycharset.dll attached to JIRA to jre/bin
6. java Hello
Hello

Could you please describe the steps you used to get the error?

Thanks.
Vladimir.

On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> Hi,
>
> I just put the dll to jre/bin and replace the jar at /lib/boot.
> unfortunately I got following error :(
>
> did I miss something?
>
> Exception in thread "main" java/lang/ExceptionInInitializerError
>        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
>        at java/lang/System.<clinit> (System.java:110)
>        at java/lang/J9VMInternals.initializeImpl (Native Method)
>        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
>        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
>        at java/lang/Thread.initialize (Thread.java:339)
>        at java/lang/Thread.<init> (Thread.java:126)
> java/lang/NullPointerException
>        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
>        at java/nio/charset/Charset.isSupported (Charset.java:578)
>        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
>        at java/lang/J9VMInternals.initializeImpl (Native Method)
>        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
>        at java/lang/System.<clinit> (System.java:110)
>        at java/lang/J9VMInternals.initializeImpl (Native Method)
>        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
>        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
>        at java/lang/Thread.initialize (Thread.java:339)
>        at java/lang/Thread.<init> (Thread.java:126)
> JVMJ9VM015W Initialization error for library jclclear_23(14):
> JVMJ9VM009E J9VMDllMain failed
> HMYEXEL062E Internal VM error: Failed to create Java VM
> FAILED to invoke JVM.
>
> On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > Hi Tony,
> >
> > header files could be generated by javac tool :)
> > let me know if you will have additional problems.
> >
> > Thanks.
> > Vladimir.
> >
> > On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi Vladimir,
> > >
> > > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > > your patch, still I can not build from src.
> > >
> > > trying the binary..
> > >
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > And I have not found a makefile for native?
> > > >
> > > > Tony,
> > > >
> > > > I've attached makefiles to the JIRA issue.
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > Hi Vladimir,
> > > > > >
> > > > > > Thanks for explanation, I'm testing it on win xp.
> > > > > >
> > > > > > I encounter a minor problem when try to build native code.
> > > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > > instead of .../src/native/niochar/shared according to readme.
> > > > > >
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > > make icu as an extension?
> > > > > > >
> > > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > > implemented charsets used from ICU.
> > > > > > >
> > > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1006_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > > x-ibm-1025_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1097_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > > x-ibm-1098_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > > x-ibm-1112_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > > x-ibm-1122_P100-1999
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > > x-ibm-1123_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > > x-ibm-1124_P100-1996
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1375_P100-2003
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-737_P100-1997
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-875_P100-1995
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-mac-cyrillic
> > > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > >
> > > > > > >
> > > > > > > Thanks.
> > > > > > > Vladimir.
> > > > > > >
> > > > > > >
> > > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > > amazing work.
> > > > > > > > > generating the charsets...
> > > > > > > > >
> > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > > >
> > > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > > >
> > > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > > >
> > > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > > tree.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > > classes.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > --
> > > > > > > > > > > > > Best regards,
> > > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Best regards,
> > > > > > > > > > > Andrew Zhang
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Tony Wu
> > > > > > > > > China Software Development Lab, IBM
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Hi,

I just put the dll to jre/bin and replace the jar at /lib/boot.
unfortunately I got following error :(

did I miss something?

Exception in thread "main" java/lang/ExceptionInInitializerError
        at java/lang/J9VMInternals.initialize (J9VMInternals.java:195)
        at java/lang/System.<clinit> (System.java:110)
        at java/lang/J9VMInternals.initializeImpl (Native Method)
        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
        at java/lang/Thread.initialize (Thread.java:339)
        at java/lang/Thread.<init> (Thread.java:126)
java/lang/NullPointerException
        at java/nio/charset/Charset.forNameInternal (Charset.java:507)
        at java/nio/charset/Charset.isSupported (Charset.java:578)
        at java/lang/String$ConsolePrintStream.<clinit> (String.java:68)
        at java/lang/J9VMInternals.initializeImpl (Native Method)
        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
        at java/lang/System.<clinit> (System.java:110)
        at java/lang/J9VMInternals.initializeImpl (Native Method)
        at java/lang/J9VMInternals.initialize (J9VMInternals.java:177)
        at java/lang/ClassLoader.initializeClassLoaders (ClassLoader.java:68)
        at java/lang/Thread.initialize (Thread.java:339)
        at java/lang/Thread.<init> (Thread.java:126)
JVMJ9VM015W Initialization error for library jclclear_23(14):
JVMJ9VM009E J9VMDllMain failed
HMYEXEL062E Internal VM error: Failed to create Java VM
FAILED to invoke JVM.

On 4/10/07, Vladimir Strigun <vs...@gmail.com> wrote:
> Hi Tony,
>
> header files could be generated by javac tool :)
> let me know if you will have additional problems.
>
> Thanks.
> Vladimir.
>
> On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi Vladimir,
> >
> > I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> > your patch, still I can not build from src.
> >
> > trying the binary..
> >
> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > And I have not found a makefile for native?
> > >
> > > Tony,
> > >
> > > I've attached makefiles to the JIRA issue.
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > Hi Vladimir,
> > > > >
> > > > > Thanks for explanation, I'm testing it on win xp.
> > > > >
> > > > > I encounter a minor problem when try to build native code.
> > > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > > instead of .../src/native/niochar/shared according to readme.
> > > > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > > make icu as an extension?
> > > > > >
> > > > > > Attached test bundle, instruction and patch for current code combined
> > > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > > available - about 90 charsets used from the new bundle, and not
> > > > > > implemented charsets used from ICU.
> > > > > >
> > > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1006_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > > x-ibm-1025_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1097_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > > x-ibm-1098_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > > x-ibm-1112_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > > x-ibm-1122_P100-1999
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > > x-ibm-1123_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > > x-ibm-1124_P100-1996
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1375_P100-2003
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-737_P100-1997
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-875_P100-1995
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-mac-cyrillic
> > > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > >
> > > > > >
> > > > > > Thanks.
> > > > > > Vladimir.
> > > > > >
> > > > > >
> > > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > > amazing work.
> > > > > > > > generating the charsets...
> > > > > > > >
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > >
> > > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > > Super cool!!!
> > > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > > >
> > > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > > >
> > > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > > >
> > > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > > >
> > > > > > > > > > > > > Hi all!
> > > > > > > > > > > > >
> > > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > > tree.
> > > > > > > > > > > > >
> > > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > > >
> > > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > > >
> > > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > > classes.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > Vladimir.
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > --
> > > > > > > > > > > > Best regards,
> > > > > > > > > > > > Andrew Zhang
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Best regards,
> > > > > > > > > > Andrew Zhang
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Tony Wu
> > > > > > > > China Software Development Lab, IBM
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
Hi Tony,

header files could be generated by javac tool :)
let me know if you will have additional problems.

Thanks.
Vladimir.

On 4/10/07, Tony Wu <wu...@gmail.com> wrote:
> Hi Vladimir,
>
> I found #include "CP_1250.h" in CP_1250.c but no .h files found in
> your patch, still I can not build from src.
>
> trying the binary..
>
> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > And I have not found a makefile for native?
> >
> > Tony,
> >
> > I've attached makefiles to the JIRA issue.
> >
> > Thanks.
> > Vladimir.
> >
> > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > Hi Vladimir,
> > > >
> > > > Thanks for explanation, I'm testing it on win xp.
> > > >
> > > > I encounter a minor problem when try to build native code.
> > > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > > instead of .../src/native/niochar/shared according to readme.
> > > >
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > > make icu as an extension?
> > > > >
> > > > > Attached test bundle, instruction and patch for current code combined
> > > > > new implementation with ICU. So, I have the same 228 charsets
> > > > > available - about 90 charsets used from the new bundle, and not
> > > > > implemented charsets used from ICU.
> > > > >
> > > > > The full list of charsets supported with the current bundle + ICU:
> > > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1006_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > > x-ibm-1025_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1097_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > > x-ibm-1098_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > > x-ibm-1112_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > > x-ibm-1122_P100-1999
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > > x-ibm-1123_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > > x-ibm-1124_P100-1996
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1375_P100-2003
> > > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-737_P100-1997
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-875_P100-1995
> > > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-mac-cyrillic
> > > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > >
> > > > >
> > > > > Thanks.
> > > > > Vladimir.
> > > > >
> > > > >
> > > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > > amazing work.
> > > > > > > generating the charsets...
> > > > > > >
> > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > >
> > > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > > Super cool!!!
> > > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > > >
> > > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > > >
> > > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > > >
> > > > > > > > Yes, you right,  we're on the way :)
> > > > > > > >
> > > > > > > >
> > > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > > >
> > > > > > > > > > > > Hi all!
> > > > > > > > > > > >
> > > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > > tree.
> > > > > > > > > > > >
> > > > > > > > > > > > The package could be found there:
> > > > > > > > > > > > HARMONY-3593
> > > > > > > > > > > >
> > > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > > >
> > > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > > classes.
> > > > > > > > > > > >
> > > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > > >
> > > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks,
> > > > > > > > > > > > Vladimir.
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > --
> > > > > > > > > > > Best regards,
> > > > > > > > > > > Andrew Zhang
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Best regards,
> > > > > > > > > Andrew Zhang
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Tony Wu
> > > > > > > China Software Development Lab, IBM
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Hi Vladimir,

I found #include "CP_1250.h" in CP_1250.c but no .h files found in
your patch, still I can not build from src.

trying the binary..

On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > And I have not found a makefile for native?
>
> Tony,
>
> I've attached makefiles to the JIRA issue.
>
> Thanks.
> Vladimir.
>
> > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi Vladimir,
> > >
> > > Thanks for explanation, I'm testing it on win xp.
> > >
> > > I encounter a minor problem when try to build native code.
> > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > instead of .../src/native/niochar/shared according to readme.
> > >
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > make icu as an extension?
> > > >
> > > > Attached test bundle, instruction and patch for current code combined
> > > > new implementation with ICU. So, I have the same 228 charsets
> > > > available - about 90 charsets used from the new bundle, and not
> > > > implemented charsets used from ICU.
> > > >
> > > > The full list of charsets supported with the current bundle + ICU:
> > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1006_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > x-ibm-1025_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1097_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > x-ibm-1098_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > x-ibm-1112_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > x-ibm-1122_P100-1999
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > x-ibm-1123_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > x-ibm-1124_P100-1996
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1375_P100-2003
> > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-737_P100-1997
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-875_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-mac-cyrillic
> > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > >
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > >
> > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > amazing work.
> > > > > > generating the charsets...
> > > > > >
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > >
> > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > Super cool!!!
> > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > >
> > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > >
> > > > > > > >
> > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > >
> > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > >
> > > > > > > Yes, you right,  we're on the way :)
> > > > > > >
> > > > > > >
> > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > >
> > > > > > > > > > > Hi all!
> > > > > > > > > > >
> > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > tree.
> > > > > > > > > > >
> > > > > > > > > > > The package could be found there:
> > > > > > > > > > > HARMONY-3593
> > > > > > > > > > >
> > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > >
> > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > classes.
> > > > > > > > > > >
> > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > >
> > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > >
> > > > > > > > > > > Thanks,
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Best regards,
> > > > > > > > > > Andrew Zhang
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Best regards,
> > > > > > > > Andrew Zhang
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Thanks, I'll try it tomorrow :)

On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > And I have not found a makefile for native?
>
> Tony,
>
> I've attached makefiles to the JIRA issue.
>
> Thanks.
> Vladimir.
>
> > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > Hi Vladimir,
> > >
> > > Thanks for explanation, I'm testing it on win xp.
> > >
> > > I encounter a minor problem when try to build native code.
> > > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > > instead of .../src/native/niochar/shared according to readme.
> > >
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > I wonder if it is possible to make it as built-in charset provider and
> > > > > make icu as an extension?
> > > >
> > > > Attached test bundle, instruction and patch for current code combined
> > > > new implementation with ICU. So, I have the same 228 charsets
> > > > available - about 90 charsets used from the new bundle, and not
> > > > implemented charsets used from ICU.
> > > >
> > > > The full list of charsets supported with the current bundle + ICU:
> > > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1006_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > > x-ibm-1025_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1097_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > > x-ibm-1098_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > > x-ibm-1112_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > > x-ibm-1122_P100-1999
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > > x-ibm-1123_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > > x-ibm-1124_P100-1996
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1375_P100-2003
> > > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-737_P100-1997
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-875_P100-1995
> > > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-mac-cyrillic
> > > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > >
> > > >
> > > > Thanks.
> > > > Vladimir.
> > > >
> > > >
> > > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > > amazing work.
> > > > > > generating the charsets...
> > > > > >
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > >
> > > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > > Super cool!!!
> > > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > > >
> > > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > > list of supported charsets you could find in README file.
> > > > > > > >
> > > > > > > >
> > > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > > >
> > > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > > >
> > > > > > > Yes, you right,  we're on the way :)
> > > > > > >
> > > > > > >
> > > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > > >
> > > > > > > > > > > Hi all!
> > > > > > > > > > >
> > > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > > tree.
> > > > > > > > > > >
> > > > > > > > > > > The package could be found there:
> > > > > > > > > > > HARMONY-3593
> > > > > > > > > > >
> > > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > > >
> > > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > > and re-generate all sources.
> > > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > > classes.
> > > > > > > > > > >
> > > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > > find in README file from contributed package.
> > > > > > > > > > >
> > > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > > >
> > > > > > > > > > > Thanks,
> > > > > > > > > > > Vladimir.
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > --
> > > > > > > > > > Best regards,
> > > > > > > > > > Andrew Zhang
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Best regards,
> > > > > > > > Andrew Zhang
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Tony Wu
> > > > > > China Software Development Lab, IBM
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> And I have not found a makefile for native?

Tony,

I've attached makefiles to the JIRA issue.

Thanks.
Vladimir.

> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > Hi Vladimir,
> >
> > Thanks for explanation, I'm testing it on win xp.
> >
> > I encounter a minor problem when try to build native code.
> > ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> > instead of .../src/native/niochar/shared according to readme.
> >
> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > I wonder if it is possible to make it as built-in charset provider and
> > > > make icu as an extension?
> > >
> > > Attached test bundle, instruction and patch for current code combined
> > > new implementation with ICU. So, I have the same 228 charsets
> > > available - about 90 charsets used from the new bundle, and not
> > > implemented charsets used from ICU.
> > >
> > > The full list of charsets supported with the current bundle + ICU:
> > > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1006_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > > x-ibm-1025_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1097_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > > x-ibm-1098_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > > x-ibm-1112_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > > x-ibm-1122_P100-1999
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > > x-ibm-1123_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > > x-ibm-1124_P100-1996
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1375_P100-2003
> > > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-737_P100-1997
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-875_P100-1995
> > > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > > x-mac-cyrillic
> > > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > >
> > >
> > > Thanks.
> > > Vladimir.
> > >
> > >
> > > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > > amazing work.
> > > > > generating the charsets...
> > > > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > >
> > > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > > Super cool!!!
> > > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > > >
> > > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > > list of supported charsets you could find in README file.
> > > > > > >
> > > > > > >
> > > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > > >
> > > > > > > We're on the way to be independent of ICU, right? ;)
> > > > > >
> > > > > > Yes, you right,  we're on the way :)
> > > > > >
> > > > > >
> > > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > > >
> > > > > > > > > > Hi all!
> > > > > > > > > >
> > > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > > tree.
> > > > > > > > > >
> > > > > > > > > > The package could be found there:
> > > > > > > > > > HARMONY-3593
> > > > > > > > > >
> > > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > > >
> > > > > > > > > > The main advantages of the package are the following:
> > > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > > and re-generate all sources.
> > > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > > classes.
> > > > > > > > > >
> > > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > > find in README file from contributed package.
> > > > > > > > > >
> > > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > > >
> > > > > > > > > > Thanks,
> > > > > > > > > > Vladimir.
> > > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Best regards,
> > > > > > > > > Andrew Zhang
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Best regards,
> > > > > > > Andrew Zhang
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Tony Wu
> > > > > China Software Development Lab, IBM
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
And I have not found a makefile for native?

On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> Hi Vladimir,
>
> Thanks for explanation, I'm testing it on win xp.
>
> I encounter a minor problem when try to build native code.
> ....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
> instead of .../src/native/niochar/shared according to readme.
>
> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > I wonder if it is possible to make it as built-in charset provider and
> > > make icu as an extension?
> >
> > Attached test bundle, instruction and patch for current code combined
> > new implementation with ICU. So, I have the same 228 charsets
> > available - about 90 charsets used from the new bundle, and not
> > implemented charsets used from ICU.
> >
> > The full list of charsets supported with the current bundle + ICU:
> > Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> > Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> > Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> > BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> > CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> > cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> > cp851    class=com.ibm.icu4jni.charset.CharsetICU
> > cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> > cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> > cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> > cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> > cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> > cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> > cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> > cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> > cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> > cp866    class=org.apache.harmony.niochar.charset.IBM866
> > cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> > cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> > cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> > EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> > EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> > GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> > GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> > GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> > GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> > hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> > HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> > IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> > IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> > IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> > IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> > IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> > IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> > IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> > IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> > IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> > IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> > IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> > IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> > IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> > IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> > IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> > IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> > IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> > IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> > IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> > IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> > IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> > IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> > IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> > IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> > IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> > IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> > IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> > IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> > IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> > IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> > IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> > IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> > IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> > ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> > ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> > ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> > ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> > ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> > ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> > ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> > ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> > ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> > ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> > ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> > ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> > ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> > ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> > ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> > ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> > JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> > JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> > KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> > KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> > KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> > macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> > SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> > Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> > TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> > US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> > UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> > UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> > UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> > UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> > UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> > UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> > UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> > UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> > windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> > windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> > windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> > windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> > windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> > windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> > windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> > windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> > windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1006_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> > x-ibm-1025_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> > x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1097_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> > x-ibm-1098_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> > x-ibm-1112_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> > x-ibm-1122_P100-1999
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> > x-ibm-1123_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> > x-ibm-1124_P100-1996
> > class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> > x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1375_P100-2003
> > class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> > x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-737_P100-1997
> > class=org.apache.harmony.niochar.charset.additional.x_IBM737
> > x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-875_P100-1995
> > class=org.apache.harmony.niochar.charset.additional.x_IBM875
> > x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> > x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> > x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> > x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> > x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> > x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> > x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> > x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> > x-mac-cyrillic
> > class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> > x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> > x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> > x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> > x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> > x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
> >
> >
> > Thanks.
> > Vladimir.
> >
> >
> > > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > > amazing work.
> > > > generating the charsets...
> > > >
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > >
> > > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > > Super cool!!!
> > > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > > >
> > > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > > list of supported charsets you could find in README file.
> > > > > >
> > > > > >
> > > > > > Hi Vladimir, not unfortunately at all. :)
> > > > > >
> > > > > > We're on the way to be independent of ICU, right? ;)
> > > > >
> > > > > Yes, you right,  we're on the way :)
> > > > >
> > > > >
> > > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > > >
> > > > > > > > > Hi all!
> > > > > > > > >
> > > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > > tree.
> > > > > > > > >
> > > > > > > > > The package could be found there:
> > > > > > > > > HARMONY-3593
> > > > > > > > >
> > > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > > >
> > > > > > > > > The main advantages of the package are the following:
> > > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > > and re-generate all sources.
> > > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > > have such a significant boost for Dacapo.
> > > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > > classes.
> > > > > > > > >
> > > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > > find in README file from contributed package.
> > > > > > > > >
> > > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > > >
> > > > > > > > > Thanks,
> > > > > > > > > Vladimir.
> > > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Best regards,
> > > > > > > > Andrew Zhang
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Best regards,
> > > > > > Andrew Zhang
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Tony Wu
> > > > China Software Development Lab, IBM
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
Hi Vladimir,

Thanks for explanation, I'm testing it on win xp.

I encounter a minor problem when try to build native code.
....\trunk\modules\nio_char\src\main\native/niochar/windows was asked
instead of .../src/native/niochar/shared according to readme.

On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > I wonder if it is possible to make it as built-in charset provider and
> > make icu as an extension?
>
> Attached test bundle, instruction and patch for current code combined
> new implementation with ICU. So, I have the same 228 charsets
> available - about 90 charsets used from the new bundle, and not
> implemented charsets used from ICU.
>
> The full list of charsets supported with the current bundle + ICU:
> Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> Big5-HKSCS       class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> cp851    class=com.ibm.icu4jni.charset.CharsetICU
> cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> cp866    class=org.apache.harmony.niochar.charset.IBM866
> cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> IBM-Thai         class=org.apache.harmony.niochar.charset.additional.IBM_Thai
> IBM01140         class=org.apache.harmony.niochar.charset.additional.IBM01140
> IBM01141         class=org.apache.harmony.niochar.charset.additional.IBM01141
> IBM01142         class=org.apache.harmony.niochar.charset.additional.IBM01142
> IBM01143         class=org.apache.harmony.niochar.charset.additional.IBM01143
> IBM01144         class=org.apache.harmony.niochar.charset.additional.IBM01144
> IBM01145         class=org.apache.harmony.niochar.charset.additional.IBM01145
> IBM01146         class=org.apache.harmony.niochar.charset.additional.IBM01146
> IBM01147         class=org.apache.harmony.niochar.charset.additional.IBM01147
> IBM01148         class=org.apache.harmony.niochar.charset.additional.IBM01148
> IBM01149         class=org.apache.harmony.niochar.charset.additional.IBM01149
> IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> ISO-8859-3       class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
> ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> ISO-8859-6       class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
> ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> ISO-8859-8       class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
> ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> KSC_5601         class=org.apache.harmony.niochar.charset.additional.x_windows_949
> macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> Shift_JIS        class=org.apache.harmony.niochar.charset.additional.windows_31j
> TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> windows-1255     class=org.apache.harmony.niochar.charset.additional.windows_1255
> windows-1256     class=org.apache.harmony.niochar.charset.additional.windows_1256
> windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1006_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> x-ibm-1025_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1097_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> x-ibm-1098_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> x-ibm-1112_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> x-ibm-1122_P100-1999
> class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> x-ibm-1123_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> x-ibm-1124_P100-1996
> class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1375_P100-2003
> class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-737_P100-1997
> class=org.apache.harmony.niochar.charset.additional.x_IBM737
> x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-875_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM875
> x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> x-mac-cyrillic
> class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> x-mac-greek      class=org.apache.harmony.niochar.charset.additional.x_MacGreek
> x-mac-turkish    class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
> x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
>
>
> Thanks.
> Vladimir.
>
>
> > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > amazing work.
> > > generating the charsets...
> > >
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > >
> > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > Super cool!!!
> > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > >
> > > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > > list of supported charsets you could find in README file.
> > > > >
> > > > >
> > > > > Hi Vladimir, not unfortunately at all. :)
> > > > >
> > > > > We're on the way to be independent of ICU, right? ;)
> > > >
> > > > Yes, you right,  we're on the way :)
> > > >
> > > >
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > >
> > > > > > > > Hi all!
> > > > > > > >
> > > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > > tree.
> > > > > > > >
> > > > > > > > The package could be found there:
> > > > > > > > HARMONY-3593
> > > > > > > >
> > > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > >
> > > > > > > > The main advantages of the package are the following:
> > > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > > some modification would be necessary we need just correct generator
> > > > > > > > and re-generate all sources.
> > > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > > have such a significant boost for Dacapo.
> > > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > > classes.
> > > > > > > >
> > > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > > find in README file from contributed package.
> > > > > > > >
> > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > >
> > > > > > > > Thanks,
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Best regards,
> > > > > > > Andrew Zhang
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Best regards,
> > > > > Andrew Zhang
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Leo Li <li...@gmail.com>.
Hi, Vladimir:
      Maybe my question is a little picky.:)
      I noticed that the data for getCharsetsInfo of our charsetprovider
implementation is hardcoded. So is it possible to provide a more flexible
approach?
      ResourceBundle is an option, but it has some problem in bootstrap.
Once I am trying ICU4J as default charset provider on Harmony, there will be
a bootstrap problem, since when loading required data for charset by
resourcebundle, the ClassLoader.loadResourceBundle still needs
charset. Then an initilization error will thrown.
     I have thought of two possible approach:
     (1) Load the charset information by native code in initialization.
     (2) Using a bootstrap charset, for example, ISO-8859-1, to load data
for other charset.

    I guess, maybe it deserves to discuss a little about the approach(2) .
It will be a great convenience for our customer if they are permitted to
choose charset provider. But some of them are not designed for class
library, that is, they will have bootstrap problem.

On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
>
> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > I wonder if it is possible to make it as built-in charset provider and
> > make icu as an extension?
>
> Attached test bundle, instruction and patch for current code combined
> new implementation with ICU. So, I have the same 228 charsets
> available - about 90 charsets used from the new bundle, and not
> implemented charsets used from ICU.
>
> The full list of charsets supported with the current bundle + ICU:
> Adobe-Standard-Encoding  class=com.ibm.icu4jni.charset.CharsetICU
> Big5     class=org.apache.harmony.niochar.charset.additional.Big5
> Big5-HKSCS       class=
> org.apache.harmony.niochar.charset.additional.Big5_HKSCS
> BOCU-1   class=com.ibm.icu4jni.charset.CharsetICU
> CESU-8   class=com.ibm.icu4jni.charset.CharsetICU
> cp850    class=org.apache.harmony.niochar.charset.additional.IBM850
> cp851    class=com.ibm.icu4jni.charset.CharsetICU
> cp856    class=org.apache.harmony.niochar.charset.additional.x_IBM856
> cp857    class=org.apache.harmony.niochar.charset.additional.IBM857
> cp858    class=org.apache.harmony.niochar.charset.additional.IBM00858
> cp860    class=org.apache.harmony.niochar.charset.additional.IBM860
> cp861    class=org.apache.harmony.niochar.charset.additional.IBM861
> cp862    class=org.apache.harmony.niochar.charset.additional.IBM862
> cp863    class=org.apache.harmony.niochar.charset.additional.IBM863
> cp864    class=org.apache.harmony.niochar.charset.additional.IBM864
> cp865    class=org.apache.harmony.niochar.charset.additional.IBM865
> cp866    class=org.apache.harmony.niochar.charset.IBM866
> cp868    class=org.apache.harmony.niochar.charset.additional.IBM868
> cp869    class=org.apache.harmony.niochar.charset.additional.IBM869
> cp922    class=org.apache.harmony.niochar.charset.additional.x_IBM922
> EUC-JP   class=com.ibm.icu4jni.charset.CharsetICU
> EUC-KR   class=org.apache.harmony.niochar.charset.additional.EUC_KR
> GB18030  class=org.apache.harmony.niochar.charset.additional.GB18030
> GB2312   class=org.apache.harmony.niochar.charset.additional.GB2312
> GB_2312-80       class=com.ibm.icu4jni.charset.CharsetICU
> GBK      class=org.apache.harmony.niochar.charset.additional.GBK
> hp-roman8        class=com.ibm.icu4jni.charset.CharsetICU
> HZ-GB-2312       class=com.ibm.icu4jni.charset.CharsetICU
> IBM-Thai         class=
> org.apache.harmony.niochar.charset.additional.IBM_Thai
> IBM01140         class=
> org.apache.harmony.niochar.charset.additional.IBM01140
> IBM01141         class=
> org.apache.harmony.niochar.charset.additional.IBM01141
> IBM01142         class=
> org.apache.harmony.niochar.charset.additional.IBM01142
> IBM01143         class=
> org.apache.harmony.niochar.charset.additional.IBM01143
> IBM01144         class=
> org.apache.harmony.niochar.charset.additional.IBM01144
> IBM01145         class=
> org.apache.harmony.niochar.charset.additional.IBM01145
> IBM01146         class=
> org.apache.harmony.niochar.charset.additional.IBM01146
> IBM01147         class=
> org.apache.harmony.niochar.charset.additional.IBM01147
> IBM01148         class=
> org.apache.harmony.niochar.charset.additional.IBM01148
> IBM01149         class=
> org.apache.harmony.niochar.charset.additional.IBM01149
> IBM037   class=org.apache.harmony.niochar.charset.additional.IBM037
> IBM1026  class=org.apache.harmony.niochar.charset.additional.IBM1026
> IBM1047  class=org.apache.harmony.niochar.charset.additional.IBM1047
> IBM273   class=org.apache.harmony.niochar.charset.additional.IBM273
> IBM277   class=org.apache.harmony.niochar.charset.additional.IBM277
> IBM278   class=org.apache.harmony.niochar.charset.additional.IBM278
> IBM280   class=org.apache.harmony.niochar.charset.additional.IBM280
> IBM284   class=org.apache.harmony.niochar.charset.additional.IBM284
> IBM285   class=org.apache.harmony.niochar.charset.additional.IBM285
> IBM290   class=com.ibm.icu4jni.charset.CharsetICU
> IBM297   class=org.apache.harmony.niochar.charset.additional.IBM297
> IBM367   class=org.apache.harmony.niochar.charset.US_ASCII
> IBM420   class=org.apache.harmony.niochar.charset.additional.IBM420
> IBM424   class=org.apache.harmony.niochar.charset.additional.IBM424
> IBM437   class=org.apache.harmony.niochar.charset.additional.IBM437
> IBM500   class=org.apache.harmony.niochar.charset.additional.IBM500
> IBM775   class=org.apache.harmony.niochar.charset.additional.IBM775
> IBM852   class=org.apache.harmony.niochar.charset.additional.IBM852
> IBM855   class=org.apache.harmony.niochar.charset.additional.IBM855
> IBM870   class=org.apache.harmony.niochar.charset.additional.IBM870
> IBM871   class=org.apache.harmony.niochar.charset.additional.IBM871
> IBM918   class=org.apache.harmony.niochar.charset.additional.IBM918
> ISO-2022-CN      class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-CN-EXT  class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-JP      class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-JP-2    class=com.ibm.icu4jni.charset.CharsetICU
> ISO-2022-KR      class=com.ibm.icu4jni.charset.CharsetICU
> ISO-8859-1       class=org.apache.harmony.niochar.charset.ISO_8859_1
> ISO-8859-13      class=org.apache.harmony.niochar.charset.ISO_8859_13
> ISO-8859-15      class=org.apache.harmony.niochar.charset.ISO_8859_15
> ISO-8859-2       class=org.apache.harmony.niochar.charset.ISO_8859_2
> ISO-8859-3       class=
> org.apache.harmony.niochar.charset.additional.ISO_8859_3
> ISO-8859-4       class=org.apache.harmony.niochar.charset.ISO_8859_4
> ISO-8859-5       class=org.apache.harmony.niochar.charset.ISO_8859_5
> ISO-8859-6       class=
> org.apache.harmony.niochar.charset.additional.ISO_8859_6
> ISO-8859-7       class=org.apache.harmony.niochar.charset.ISO_8859_7
> ISO-8859-8       class=
> org.apache.harmony.niochar.charset.additional.ISO_8859_8
> ISO-8859-9       class=org.apache.harmony.niochar.charset.ISO_8859_9
> JIS_Encoding     class=com.ibm.icu4jni.charset.CharsetICU
> JIS_X0201        class=com.ibm.icu4jni.charset.CharsetICU
> KOI8-R   class=org.apache.harmony.niochar.charset.KOI8_R
> KOI8-U   class=com.ibm.icu4jni.charset.CharsetICU
> KSC_5601         class=
> org.apache.harmony.niochar.charset.additional.x_windows_949
> macintosh        class=com.ibm.icu4jni.charset.CharsetICU
> SCSU     class=com.ibm.icu4jni.charset.CharsetICU
> Shift_JIS        class=
> org.apache.harmony.niochar.charset.additional.windows_31j
> TIS-620  class=org.apache.harmony.niochar.charset.additional.x_IBM874
> US-ASCII         class=org.apache.harmony.niochar.charset.US_ASCII
> UTF-16   class=org.apache.harmony.niochar.charset.UTF_16
> UTF-16BE         class=org.apache.harmony.niochar.charset.UTF_16BE
> UTF-16LE         class=org.apache.harmony.niochar.charset.UTF_16LE
> UTF-32   class=com.ibm.icu4jni.charset.CharsetICU
> UTF-32BE         class=com.ibm.icu4jni.charset.CharsetICU
> UTF-32LE         class=com.ibm.icu4jni.charset.CharsetICU
> UTF-7    class=com.ibm.icu4jni.charset.CharsetICU
> UTF-8    class=org.apache.harmony.niochar.charset.UTF_8
> windows-1250     class=org.apache.harmony.niochar.charset.CP_1250
> windows-1251     class=org.apache.harmony.niochar.charset.CP_1251
> windows-1252     class=org.apache.harmony.niochar.charset.CP_1252
> windows-1253     class=org.apache.harmony.niochar.charset.CP_1253
> windows-1254     class=org.apache.harmony.niochar.charset.CP_1254
> windows-1255     class=
> org.apache.harmony.niochar.charset.additional.windows_1255
> windows-1256     class=
> org.apache.harmony.niochar.charset.additional.windows_1256
> windows-1257     class=org.apache.harmony.niochar.charset.CP_1257
> windows-1258     class=com.ibm.icu4jni.charset.CharsetICU
> x-ebcdic-xml-us  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1006_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1006
> x-ibm-1025_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1025
> x-ibm-1047-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1097_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1097
> x-ibm-1098_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1098
> x-ibm-1112_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1112
> x-ibm-1122_P100-1999
> class=org.apache.harmony.niochar.charset.additional.x_IBM1122
> x-ibm-1123_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM1123
> x-ibm-1124_P100-1996
> class=org.apache.harmony.niochar.charset.additional.x_IBM1124
> x-ibm-1125_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1129_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1130_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1131_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1132_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1133_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1137_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1140-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1142-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1143-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1144-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1145-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1146-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1147-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1148-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1149-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1153-s390  class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1153_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1154_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1155_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1156_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1157_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1158_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1160_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1162_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1164_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1250_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1251_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1252_P100-2000     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1253_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1254_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1255_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1256_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1257_P100-1995     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1258_P100-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-12712-s390         class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-12712_P100-1998    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1363_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1364_P110-1997     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1371_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1373_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1375_P100-2003
> class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
> x-ibm-1386_P100-2002     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1388_P103-2001     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1390_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-1399_P110-2003     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-16684_P110-2003    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-16804-s390         class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-16804_X110-1999    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-25546      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-33722_P120-1999    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-37-s390    class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-4899_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-4909_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-4971_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5123_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5351_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5352_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-5353_P100-1998     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-737_P100-1997
> class=org.apache.harmony.niochar.charset.additional.x_IBM737
> x-ibm-803_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-813_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-8482_P100-1999     class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-867_P100-1998      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-875_P100-1995
> class=org.apache.harmony.niochar.charset.additional.x_IBM875
> x-ibm-901_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-902_P100-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-930_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-933_P110-1995      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-935_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-937_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-939_P120-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-942_P12A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-943_P130-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-949_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-949_P11A-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-950_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-954_P101-2000      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-964_P110-1999      class=com.ibm.icu4jni.charset.CharsetICU
> x-ibm-971_P100-1995      class=com.ibm.icu4jni.charset.CharsetICU
> x-IMAP-mailbox-name      class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-be       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-de       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-gu       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-ka       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-ma       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-or       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-pa       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-ta       class=com.ibm.icu4jni.charset.CharsetICU
> x-iscii-te       class=com.ibm.icu4jni.charset.CharsetICU
> x-JIS7   class=com.ibm.icu4jni.charset.CharsetICU
> x-JIS8   class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-1        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-11       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-16       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-17       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-18       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-19       class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-2        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-3        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-4        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-5        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-6        class=com.ibm.icu4jni.charset.CharsetICU
> x-LMBCS-8        class=com.ibm.icu4jni.charset.CharsetICU
> x-mac-centraleurroman    class=com.ibm.icu4jni.charset.CharsetICU
> x-mac-cyrillic
> class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
> x-mac-greek      class=
> org.apache.harmony.niochar.charset.additional.x_MacGreek
> x-mac-turkish    class=
> org.apache.harmony.niochar.charset.additional.x_MacTurkish
> x-UTF16_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-UTF16_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-UTF32_OppositeEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-UTF32_PlatformEndian   class=com.ibm.icu4jni.charset.CharsetICU
> x-windows-874-2000       class=com.ibm.icu4jni.charset.CharsetICU
> x-windows-949-2000       class=com.ibm.icu4jni.charset.CharsetICU
>
>
> Thanks.
> Vladimir.
>
>
> > On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > > amazing work.
> > > generating the charsets...
> > >
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > >
> > > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > > Super cool!!!
> > > > > > > Does it mean we're not dependent on ICU any more?
> > > > > >
> > > > > > Unfortunately not all charsets supported with attached bundle.
> The
> > > > > > list of supported charsets you could find in README file.
> > > > >
> > > > >
> > > > > Hi Vladimir, not unfortunately at all. :)
> > > > >
> > > > > We're on the way to be independent of ICU, right? ;)
> > > >
> > > > Yes, you right,  we're on the way :)
> > > >
> > > >
> > > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > > >
> > > > > > > > Hi all!
> > > > > > > >
> > > > > > > > I'm happy to announce one more contribution to harmony on
> behalf of
> > > > > > > > Intel. Provided implementation of charset encoders/decoders
> is
> > > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > > operations. The code was developed in clean-room environment
> inside
> > > > > > > > Intel and I'd like you to play with it and include to
> current Harmony
> > > > > > > > tree.
> > > > > > > >
> > > > > > > > The package could be found there:
> > > > > > > > HARMONY-3593
> > > > > > > >
> > > > > > > > The algorithms for charsets encoding/decoding differs from
> that of
> > > > > > > > ICU, all charsets are generated from current Harmony or any
> other
> > > > > > > > implementation of Java and could be properly integrated into
> current
> > > > > > > > nio_char module. The archive contains source files for 6
> charsets:
> > > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE,
> UTF-16LE;
> > > > > > > > implementation of CharsetProvider; generator for other
> Charsets and
> > > > > > > > native part. I've tested the package with more that 90
> charsets, and
> > > > > > > > all benchmarks and tests passed with new bundle.
> Additionally I have
> > > > > > > > significant boost for Dacapo.antlr and Dacapo.xalanbenchmarks with
> > > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have
> 2.5x boost
> > > > > > > > for antlr and ~5-8x for xalan.
> > > > > > > >
> > > > > > > > The main advantages of the package are the following:
> > > > > > > >   - Code for every charset is generated by CharsetGenerator,
> thus, if
> > > > > > > > some modification would be necessary we need just correct
> generator
> > > > > > > > and re-generate all sources.
> > > > > > > >   - We use 2 different encoders and decoders for java and
> direct
> > > > > > > > buffers. Since most applications use java heap buffers,
> unlike
> > > > > > > > existing implementation it doesn't produce lots of native
> calls to
> > > > > > > > perform encoding/decoding operations on the java buffers
> those
> > > > > > > > significantly improving performance. This is the main reason
> why we
> > > > > > > > have such a significant boost for Dacapo.
> > > > > > > >   - Charset tables for encoding/decoding are stored in
> appropriate
> > > > > > > > classes.
> > > > > > > >
> > > > > > > > Since the package contains implementation for 6 charsets
> only,
> > > > > > > > documentations how to generate and build additional charsets
> you could
> > > > > > > > find in README file from contributed package.
> > > > > > > >
> > > > > > > > Please do not hesitate to contact me for more details.
> > > > > > > >
> > > > > > > > Thanks,
> > > > > > > > Vladimir.
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Best regards,
> > > > > > > Andrew Zhang
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Best regards,
> > > > > Andrew Zhang
> > > > >
> > > >
> > >
> > >
> > > --
> > > Tony Wu
> > > China Software Development Lab, IBM
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>



-- 
Leo Li
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> I wonder if it is possible to make it as built-in charset provider and
> make icu as an extension?

Attached test bundle, instruction and patch for current code combined
new implementation with ICU. So, I have the same 228 charsets
available - about 90 charsets used from the new bundle, and not
implemented charsets used from ICU.

The full list of charsets supported with the current bundle + ICU:
Adobe-Standard-Encoding	 class=com.ibm.icu4jni.charset.CharsetICU
Big5	 class=org.apache.harmony.niochar.charset.additional.Big5
Big5-HKSCS	 class=org.apache.harmony.niochar.charset.additional.Big5_HKSCS
BOCU-1	 class=com.ibm.icu4jni.charset.CharsetICU
CESU-8	 class=com.ibm.icu4jni.charset.CharsetICU
cp850	 class=org.apache.harmony.niochar.charset.additional.IBM850
cp851	 class=com.ibm.icu4jni.charset.CharsetICU
cp856	 class=org.apache.harmony.niochar.charset.additional.x_IBM856
cp857	 class=org.apache.harmony.niochar.charset.additional.IBM857
cp858	 class=org.apache.harmony.niochar.charset.additional.IBM00858
cp860	 class=org.apache.harmony.niochar.charset.additional.IBM860
cp861	 class=org.apache.harmony.niochar.charset.additional.IBM861
cp862	 class=org.apache.harmony.niochar.charset.additional.IBM862
cp863	 class=org.apache.harmony.niochar.charset.additional.IBM863
cp864	 class=org.apache.harmony.niochar.charset.additional.IBM864
cp865	 class=org.apache.harmony.niochar.charset.additional.IBM865
cp866	 class=org.apache.harmony.niochar.charset.IBM866
cp868	 class=org.apache.harmony.niochar.charset.additional.IBM868
cp869	 class=org.apache.harmony.niochar.charset.additional.IBM869
cp922	 class=org.apache.harmony.niochar.charset.additional.x_IBM922
EUC-JP	 class=com.ibm.icu4jni.charset.CharsetICU
EUC-KR	 class=org.apache.harmony.niochar.charset.additional.EUC_KR
GB18030	 class=org.apache.harmony.niochar.charset.additional.GB18030
GB2312	 class=org.apache.harmony.niochar.charset.additional.GB2312
GB_2312-80	 class=com.ibm.icu4jni.charset.CharsetICU
GBK	 class=org.apache.harmony.niochar.charset.additional.GBK
hp-roman8	 class=com.ibm.icu4jni.charset.CharsetICU
HZ-GB-2312	 class=com.ibm.icu4jni.charset.CharsetICU
IBM-Thai	 class=org.apache.harmony.niochar.charset.additional.IBM_Thai
IBM01140	 class=org.apache.harmony.niochar.charset.additional.IBM01140
IBM01141	 class=org.apache.harmony.niochar.charset.additional.IBM01141
IBM01142	 class=org.apache.harmony.niochar.charset.additional.IBM01142
IBM01143	 class=org.apache.harmony.niochar.charset.additional.IBM01143
IBM01144	 class=org.apache.harmony.niochar.charset.additional.IBM01144
IBM01145	 class=org.apache.harmony.niochar.charset.additional.IBM01145
IBM01146	 class=org.apache.harmony.niochar.charset.additional.IBM01146
IBM01147	 class=org.apache.harmony.niochar.charset.additional.IBM01147
IBM01148	 class=org.apache.harmony.niochar.charset.additional.IBM01148
IBM01149	 class=org.apache.harmony.niochar.charset.additional.IBM01149
IBM037	 class=org.apache.harmony.niochar.charset.additional.IBM037
IBM1026	 class=org.apache.harmony.niochar.charset.additional.IBM1026
IBM1047	 class=org.apache.harmony.niochar.charset.additional.IBM1047
IBM273	 class=org.apache.harmony.niochar.charset.additional.IBM273
IBM277	 class=org.apache.harmony.niochar.charset.additional.IBM277
IBM278	 class=org.apache.harmony.niochar.charset.additional.IBM278
IBM280	 class=org.apache.harmony.niochar.charset.additional.IBM280
IBM284	 class=org.apache.harmony.niochar.charset.additional.IBM284
IBM285	 class=org.apache.harmony.niochar.charset.additional.IBM285
IBM290	 class=com.ibm.icu4jni.charset.CharsetICU
IBM297	 class=org.apache.harmony.niochar.charset.additional.IBM297
IBM367	 class=org.apache.harmony.niochar.charset.US_ASCII
IBM420	 class=org.apache.harmony.niochar.charset.additional.IBM420
IBM424	 class=org.apache.harmony.niochar.charset.additional.IBM424
IBM437	 class=org.apache.harmony.niochar.charset.additional.IBM437
IBM500	 class=org.apache.harmony.niochar.charset.additional.IBM500
IBM775	 class=org.apache.harmony.niochar.charset.additional.IBM775
IBM852	 class=org.apache.harmony.niochar.charset.additional.IBM852
IBM855	 class=org.apache.harmony.niochar.charset.additional.IBM855
IBM870	 class=org.apache.harmony.niochar.charset.additional.IBM870
IBM871	 class=org.apache.harmony.niochar.charset.additional.IBM871
IBM918	 class=org.apache.harmony.niochar.charset.additional.IBM918
ISO-2022-CN	 class=com.ibm.icu4jni.charset.CharsetICU
ISO-2022-CN-EXT	 class=com.ibm.icu4jni.charset.CharsetICU
ISO-2022-JP	 class=com.ibm.icu4jni.charset.CharsetICU
ISO-2022-JP-2	 class=com.ibm.icu4jni.charset.CharsetICU
ISO-2022-KR	 class=com.ibm.icu4jni.charset.CharsetICU
ISO-8859-1	 class=org.apache.harmony.niochar.charset.ISO_8859_1
ISO-8859-13	 class=org.apache.harmony.niochar.charset.ISO_8859_13
ISO-8859-15	 class=org.apache.harmony.niochar.charset.ISO_8859_15
ISO-8859-2	 class=org.apache.harmony.niochar.charset.ISO_8859_2
ISO-8859-3	 class=org.apache.harmony.niochar.charset.additional.ISO_8859_3
ISO-8859-4	 class=org.apache.harmony.niochar.charset.ISO_8859_4
ISO-8859-5	 class=org.apache.harmony.niochar.charset.ISO_8859_5
ISO-8859-6	 class=org.apache.harmony.niochar.charset.additional.ISO_8859_6
ISO-8859-7	 class=org.apache.harmony.niochar.charset.ISO_8859_7
ISO-8859-8	 class=org.apache.harmony.niochar.charset.additional.ISO_8859_8
ISO-8859-9	 class=org.apache.harmony.niochar.charset.ISO_8859_9
JIS_Encoding	 class=com.ibm.icu4jni.charset.CharsetICU
JIS_X0201	 class=com.ibm.icu4jni.charset.CharsetICU
KOI8-R	 class=org.apache.harmony.niochar.charset.KOI8_R
KOI8-U	 class=com.ibm.icu4jni.charset.CharsetICU
KSC_5601	 class=org.apache.harmony.niochar.charset.additional.x_windows_949
macintosh	 class=com.ibm.icu4jni.charset.CharsetICU
SCSU	 class=com.ibm.icu4jni.charset.CharsetICU
Shift_JIS	 class=org.apache.harmony.niochar.charset.additional.windows_31j
TIS-620	 class=org.apache.harmony.niochar.charset.additional.x_IBM874
US-ASCII	 class=org.apache.harmony.niochar.charset.US_ASCII
UTF-16	 class=org.apache.harmony.niochar.charset.UTF_16
UTF-16BE	 class=org.apache.harmony.niochar.charset.UTF_16BE
UTF-16LE	 class=org.apache.harmony.niochar.charset.UTF_16LE
UTF-32	 class=com.ibm.icu4jni.charset.CharsetICU
UTF-32BE	 class=com.ibm.icu4jni.charset.CharsetICU
UTF-32LE	 class=com.ibm.icu4jni.charset.CharsetICU
UTF-7	 class=com.ibm.icu4jni.charset.CharsetICU
UTF-8	 class=org.apache.harmony.niochar.charset.UTF_8
windows-1250	 class=org.apache.harmony.niochar.charset.CP_1250
windows-1251	 class=org.apache.harmony.niochar.charset.CP_1251
windows-1252	 class=org.apache.harmony.niochar.charset.CP_1252
windows-1253	 class=org.apache.harmony.niochar.charset.CP_1253
windows-1254	 class=org.apache.harmony.niochar.charset.CP_1254
windows-1255	 class=org.apache.harmony.niochar.charset.additional.windows_1255
windows-1256	 class=org.apache.harmony.niochar.charset.additional.windows_1256
windows-1257	 class=org.apache.harmony.niochar.charset.CP_1257
windows-1258	 class=com.ibm.icu4jni.charset.CharsetICU
x-ebcdic-xml-us	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1006_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM1006
x-ibm-1025_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM1025
x-ibm-1047-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1097_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM1097
x-ibm-1098_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM1098
x-ibm-1112_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM1112
x-ibm-1122_P100-1999	
class=org.apache.harmony.niochar.charset.additional.x_IBM1122
x-ibm-1123_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM1123
x-ibm-1124_P100-1996	
class=org.apache.harmony.niochar.charset.additional.x_IBM1124
x-ibm-1125_P100-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1129_P100-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1130_P100-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1131_P100-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1132_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1133_P100-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1137_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1140-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1142-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1143-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1144-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1145-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1146-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1147-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1148-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1149-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1153-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1153_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1154_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1155_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1156_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1157_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1158_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1160_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1162_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1164_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1250_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1251_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1252_P100-2000	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1253_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1254_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1255_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1256_P110-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1257_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1258_P100-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-12712-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-12712_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1363_P110-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1364_P110-1997	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1371_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1373_P100-2002	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1375_P100-2003	
class=org.apache.harmony.niochar.charset.additional.x_MS950_HKSCS
x-ibm-1386_P100-2002	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1388_P103-2001	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1390_P110-2003	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-1399_P110-2003	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-16684_P110-2003	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-16804-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-16804_X110-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-25546	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-33722_P120-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-37-s390	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-4899_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-4909_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-4971_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-5123_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-5351_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-5352_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-5353_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-737_P100-1997	
class=org.apache.harmony.niochar.charset.additional.x_IBM737
x-ibm-803_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-813_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-8482_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-867_P100-1998	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-875_P100-1995	
class=org.apache.harmony.niochar.charset.additional.x_IBM875
x-ibm-901_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-902_P100-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-930_P120-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-933_P110-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-935_P110-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-937_P110-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-939_P120-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-942_P12A-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-943_P130-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-949_P110-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-949_P11A-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-950_P110-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-954_P101-2000	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-964_P110-1999	 class=com.ibm.icu4jni.charset.CharsetICU
x-ibm-971_P100-1995	 class=com.ibm.icu4jni.charset.CharsetICU
x-IMAP-mailbox-name	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-be	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-de	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-gu	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-ka	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-ma	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-or	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-pa	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-ta	 class=com.ibm.icu4jni.charset.CharsetICU
x-iscii-te	 class=com.ibm.icu4jni.charset.CharsetICU
x-JIS7	 class=com.ibm.icu4jni.charset.CharsetICU
x-JIS8	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-1	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-11	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-16	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-17	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-18	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-19	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-2	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-3	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-4	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-5	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-6	 class=com.ibm.icu4jni.charset.CharsetICU
x-LMBCS-8	 class=com.ibm.icu4jni.charset.CharsetICU
x-mac-centraleurroman	 class=com.ibm.icu4jni.charset.CharsetICU
x-mac-cyrillic	
class=org.apache.harmony.niochar.charset.additional.x_MacCyrillic
x-mac-greek	 class=org.apache.harmony.niochar.charset.additional.x_MacGreek
x-mac-turkish	 class=org.apache.harmony.niochar.charset.additional.x_MacTurkish
x-UTF16_OppositeEndian	 class=com.ibm.icu4jni.charset.CharsetICU
x-UTF16_PlatformEndian	 class=com.ibm.icu4jni.charset.CharsetICU
x-UTF32_OppositeEndian	 class=com.ibm.icu4jni.charset.CharsetICU
x-UTF32_PlatformEndian	 class=com.ibm.icu4jni.charset.CharsetICU
x-windows-874-2000	 class=com.ibm.icu4jni.charset.CharsetICU
x-windows-949-2000	 class=com.ibm.icu4jni.charset.CharsetICU


Thanks.
Vladimir.


> On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> > amazing work.
> > generating the charsets...
> >
> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > >
> > > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > > Super cool!!!
> > > > > > Does it mean we're not dependent on ICU any more?
> > > > >
> > > > > Unfortunately not all charsets supported with attached bundle. The
> > > > > list of supported charsets you could find in README file.
> > > >
> > > >
> > > > Hi Vladimir, not unfortunately at all. :)
> > > >
> > > > We're on the way to be independent of ICU, right? ;)
> > >
> > > Yes, you right,  we're on the way :)
> > >
> > >
> > > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > > >
> > > > > > > Hi all!
> > > > > > >
> > > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > > operations. The code was developed in clean-room environment inside
> > > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > > tree.
> > > > > > >
> > > > > > > The package could be found there:
> > > > > > > HARMONY-3593
> > > > > > >
> > > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > > implementation of Java and could be properly integrated into current
> > > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > > for antlr and ~5-8x for xalan.
> > > > > > >
> > > > > > > The main advantages of the package are the following:
> > > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > > some modification would be necessary we need just correct generator
> > > > > > > and re-generate all sources.
> > > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > > significantly improving performance. This is the main reason why we
> > > > > > > have such a significant boost for Dacapo.
> > > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > > classes.
> > > > > > >
> > > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > > documentations how to generate and build additional charsets you could
> > > > > > > find in README file from contributed package.
> > > > > > >
> > > > > > > Please do not hesitate to contact me for more details.
> > > > > > >
> > > > > > > Thanks,
> > > > > > > Vladimir.
> > > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Best regards,
> > > > > > Andrew Zhang
> > > > > >
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Best regards,
> > > > Andrew Zhang
> > > >
> > >
> >
> >
> > --
> > Tony Wu
> > China Software Development Lab, IBM
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>

RE: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by "Volynets, Vera" <ve...@intel.com>.
Hi,
there is one small point about our independence from icu.
Vm uses icu4c during classfile parsing. It would be great if we have the
same functionality and don't use icu4c.
Do you work on it?

WBR,Vera!

-----Original Message-----
From: Tony Wu [mailto:wuyuehao@gmail.com] 
Sent: Monday, April 09, 2007 1:50 PM
To: dev@harmony.apache.org
Subject: Re: [contribution] Contribution of charset encoders/decoders
for NIO_CHAR module

I wonder if it is possible to make it as built-in charset provider and
make icu as an extension?

On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> amazing work.
> generating the charsets...
>
> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > >
> > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > Super cool!!!
> > > > > Does it mean we're not dependent on ICU any more?
> > > >
> > > > Unfortunately not all charsets supported with attached bundle.
The
> > > > list of supported charsets you could find in README file.
> > >
> > >
> > > Hi Vladimir, not unfortunately at all. :)
> > >
> > > We're on the way to be independent of ICU, right? ;)
> >
> > Yes, you right,  we're on the way :)
> >
> >
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > >
> > > > > > Hi all!
> > > > > >
> > > > > > I'm happy to announce one more contribution to harmony on
behalf of
> > > > > > Intel. Provided implementation of charset encoders/decoders
is
> > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > operations. The code was developed in clean-room environment
inside
> > > > > > Intel and I'd like you to play with it and include to
current Harmony
> > > > > > tree.
> > > > > >
> > > > > > The package could be found there:
> > > > > > HARMONY-3593
> > > > > >
> > > > > > The algorithms for charsets encoding/decoding differs from
that of
> > > > > > ICU, all charsets are generated from current Harmony or any
other
> > > > > > implementation of Java and could be properly integrated into
current
> > > > > > nio_char module. The archive contains source files for 6
charsets:
> > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE,
UTF-16LE;
> > > > > > implementation of CharsetProvider; generator for other
Charsets and
> > > > > > native part. I've tested the package with more that 90
charsets, and
> > > > > > all benchmarks and tests passed with new bundle.
Additionally I have
> > > > > > significant boost for Dacapo.antlr and Dacapo.xalan
benchmarks with
> > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have
2.5x boost
> > > > > > for antlr and ~5-8x for xalan.
> > > > > >
> > > > > > The main advantages of the package are the following:
> > > > > >   - Code for every charset is generated by CharsetGenerator,
thus, if
> > > > > > some modification would be necessary we need just correct
generator
> > > > > > and re-generate all sources.
> > > > > >   - We use 2 different encoders and decoders for java and
direct
> > > > > > buffers. Since most applications use java heap buffers,
unlike
> > > > > > existing implementation it doesn't produce lots of native
calls to
> > > > > > perform encoding/decoding operations on the java buffers
those
> > > > > > significantly improving performance. This is the main reason
why we
> > > > > > have such a significant boost for Dacapo.
> > > > > >   - Charset tables for encoding/decoding are stored in
appropriate
> > > > > > classes.
> > > > > >
> > > > > > Since the package contains implementation for 6 charsets
only,
> > > > > > documentations how to generate and build additional charsets
you could
> > > > > > find in README file from contributed package.
> > > > > >
> > > > > > Please do not hesitate to contact me for more details.
> > > > > >
> > > > > > Thanks,
> > > > > > Vladimir.
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Best regards,
> > > > > Andrew Zhang
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Best regards,
> > > Andrew Zhang
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
I wonder if it is possible to make it as built-in charset provider and
make icu as an extension?

On 4/9/07, Tony Wu <wu...@gmail.com> wrote:
> amazing work.
> generating the charsets...
>
> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > >
> > > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > > Super cool!!!
> > > > > Does it mean we're not dependent on ICU any more?
> > > >
> > > > Unfortunately not all charsets supported with attached bundle. The
> > > > list of supported charsets you could find in README file.
> > >
> > >
> > > Hi Vladimir, not unfortunately at all. :)
> > >
> > > We're on the way to be independent of ICU, right? ;)
> >
> > Yes, you right,  we're on the way :)
> >
> >
> > > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > > >
> > > > > > Hi all!
> > > > > >
> > > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > > operations. The code was developed in clean-room environment inside
> > > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > > tree.
> > > > > >
> > > > > > The package could be found there:
> > > > > > HARMONY-3593
> > > > > >
> > > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > > implementation of Java and could be properly integrated into current
> > > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > > for antlr and ~5-8x for xalan.
> > > > > >
> > > > > > The main advantages of the package are the following:
> > > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > > some modification would be necessary we need just correct generator
> > > > > > and re-generate all sources.
> > > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > > existing implementation it doesn't produce lots of native calls to
> > > > > > perform encoding/decoding operations on the java buffers those
> > > > > > significantly improving performance. This is the main reason why we
> > > > > > have such a significant boost for Dacapo.
> > > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > > classes.
> > > > > >
> > > > > > Since the package contains implementation for 6 charsets only,
> > > > > > documentations how to generate and build additional charsets you could
> > > > > > find in README file from contributed package.
> > > > > >
> > > > > > Please do not hesitate to contact me for more details.
> > > > > >
> > > > > > Thanks,
> > > > > > Vladimir.
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Best regards,
> > > > > Andrew Zhang
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Best regards,
> > > Andrew Zhang
> > >
> >
>
>
> --
> Tony Wu
> China Software Development Lab, IBM
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Tony Wu <wu...@gmail.com>.
amazing work.
generating the charsets...

On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > >
> > > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > > Super cool!!!
> > > > Does it mean we're not dependent on ICU any more?
> > >
> > > Unfortunately not all charsets supported with attached bundle. The
> > > list of supported charsets you could find in README file.
> >
> >
> > Hi Vladimir, not unfortunately at all. :)
> >
> > We're on the way to be independent of ICU, right? ;)
>
> Yes, you right,  we're on the way :)
>
>
> > > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > > >
> > > > > Hi all!
> > > > >
> > > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > > Intel. Provided implementation of charset encoders/decoders is
> > > > > intended to replace the ICU-based charsets encoding/decoding
> > > > > operations. The code was developed in clean-room environment inside
> > > > > Intel and I'd like you to play with it and include to current Harmony
> > > > > tree.
> > > > >
> > > > > The package could be found there:
> > > > > HARMONY-3593
> > > > >
> > > > > The algorithms for charsets encoding/decoding differs from that of
> > > > > ICU, all charsets are generated from current Harmony or any other
> > > > > implementation of Java and could be properly integrated into current
> > > > > nio_char module. The archive contains source files for 6 charsets:
> > > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > > implementation of CharsetProvider; generator for other Charsets and
> > > > > native part. I've tested the package with more that 90 charsets, and
> > > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > > for antlr and ~5-8x for xalan.
> > > > >
> > > > > The main advantages of the package are the following:
> > > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > > some modification would be necessary we need just correct generator
> > > > > and re-generate all sources.
> > > > >   - We use 2 different encoders and decoders for java and direct
> > > > > buffers. Since most applications use java heap buffers, unlike
> > > > > existing implementation it doesn't produce lots of native calls to
> > > > > perform encoding/decoding operations on the java buffers those
> > > > > significantly improving performance. This is the main reason why we
> > > > > have such a significant boost for Dacapo.
> > > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > > classes.
> > > > >
> > > > > Since the package contains implementation for 6 charsets only,
> > > > > documentations how to generate and build additional charsets you could
> > > > > find in README file from contributed package.
> > > > >
> > > > > Please do not hesitate to contact me for more details.
> > > > >
> > > > > Thanks,
> > > > > Vladimir.
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Best regards,
> > > > Andrew Zhang
> > > >
> > >
> >
> >
> >
> > --
> > Best regards,
> > Andrew Zhang
> >
>


-- 
Tony Wu
China Software Development Lab, IBM

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> >
> > On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > > Super cool!!!
> > > Does it mean we're not dependent on ICU any more?
> >
> > Unfortunately not all charsets supported with attached bundle. The
> > list of supported charsets you could find in README file.
>
>
> Hi Vladimir, not unfortunately at all. :)
>
> We're on the way to be independent of ICU, right? ;)

Yes, you right,  we're on the way :)


> > On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > > >
> > > > Hi all!
> > > >
> > > > I'm happy to announce one more contribution to harmony on behalf of
> > > > Intel. Provided implementation of charset encoders/decoders is
> > > > intended to replace the ICU-based charsets encoding/decoding
> > > > operations. The code was developed in clean-room environment inside
> > > > Intel and I'd like you to play with it and include to current Harmony
> > > > tree.
> > > >
> > > > The package could be found there:
> > > > HARMONY-3593
> > > >
> > > > The algorithms for charsets encoding/decoding differs from that of
> > > > ICU, all charsets are generated from current Harmony or any other
> > > > implementation of Java and could be properly integrated into current
> > > > nio_char module. The archive contains source files for 6 charsets:
> > > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > > implementation of CharsetProvider; generator for other Charsets and
> > > > native part. I've tested the package with more that 90 charsets, and
> > > > all benchmarks and tests passed with new bundle. Additionally I have
> > > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > > for antlr and ~5-8x for xalan.
> > > >
> > > > The main advantages of the package are the following:
> > > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > > some modification would be necessary we need just correct generator
> > > > and re-generate all sources.
> > > >   - We use 2 different encoders and decoders for java and direct
> > > > buffers. Since most applications use java heap buffers, unlike
> > > > existing implementation it doesn't produce lots of native calls to
> > > > perform encoding/decoding operations on the java buffers those
> > > > significantly improving performance. This is the main reason why we
> > > > have such a significant boost for Dacapo.
> > > >   - Charset tables for encoding/decoding are stored in appropriate
> > > > classes.
> > > >
> > > > Since the package contains implementation for 6 charsets only,
> > > > documentations how to generate and build additional charsets you could
> > > > find in README file from contributed package.
> > > >
> > > > Please do not hesitate to contact me for more details.
> > > >
> > > > Thanks,
> > > > Vladimir.
> > > >
> > >
> > >
> > >
> > > --
> > > Best regards,
> > > Andrew Zhang
> > >
> >
>
>
>
> --
> Best regards,
> Andrew Zhang
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Andrew Zhang <zh...@gmail.com>.
On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
>
> On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> > Super cool!!!
> > Does it mean we're not dependent on ICU any more?
>
> Unfortunately not all charsets supported with attached bundle. The
> list of supported charsets you could find in README file.


Hi Vladimir, not unfortunately at all. :)

We're on the way to be independent of ICU, right? ;)

> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> > >
> > > Hi all!
> > >
> > > I'm happy to announce one more contribution to harmony on behalf of
> > > Intel. Provided implementation of charset encoders/decoders is
> > > intended to replace the ICU-based charsets encoding/decoding
> > > operations. The code was developed in clean-room environment inside
> > > Intel and I'd like you to play with it and include to current Harmony
> > > tree.
> > >
> > > The package could be found there:
> > > HARMONY-3593
> > >
> > > The algorithms for charsets encoding/decoding differs from that of
> > > ICU, all charsets are generated from current Harmony or any other
> > > implementation of Java and could be properly integrated into current
> > > nio_char module. The archive contains source files for 6 charsets:
> > > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > > implementation of CharsetProvider; generator for other Charsets and
> > > native part. I've tested the package with more that 90 charsets, and
> > > all benchmarks and tests passed with new bundle. Additionally I have
> > > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > > for antlr and ~5-8x for xalan.
> > >
> > > The main advantages of the package are the following:
> > >   - Code for every charset is generated by CharsetGenerator, thus, if
> > > some modification would be necessary we need just correct generator
> > > and re-generate all sources.
> > >   - We use 2 different encoders and decoders for java and direct
> > > buffers. Since most applications use java heap buffers, unlike
> > > existing implementation it doesn't produce lots of native calls to
> > > perform encoding/decoding operations on the java buffers those
> > > significantly improving performance. This is the main reason why we
> > > have such a significant boost for Dacapo.
> > >   - Charset tables for encoding/decoding are stored in appropriate
> > > classes.
> > >
> > > Since the package contains implementation for 6 charsets only,
> > > documentations how to generate and build additional charsets you could
> > > find in README file from contributed package.
> > >
> > > Please do not hesitate to contact me for more details.
> > >
> > > Thanks,
> > > Vladimir.
> > >
> >
> >
> >
> > --
> > Best regards,
> > Andrew Zhang
> >
>



-- 
Best regards,
Andrew Zhang

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Vladimir Strigun <vs...@gmail.com>.
On 4/9/07, Andrew Zhang <zh...@gmail.com> wrote:
> Super cool!!!
> Does it mean we're not dependent on ICU any more?

Unfortunately not all charsets supported with attached bundle. The
list of supported charsets you could find in README file.

> On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
> >
> > Hi all!
> >
> > I'm happy to announce one more contribution to harmony on behalf of
> > Intel. Provided implementation of charset encoders/decoders is
> > intended to replace the ICU-based charsets encoding/decoding
> > operations. The code was developed in clean-room environment inside
> > Intel and I'd like you to play with it and include to current Harmony
> > tree.
> >
> > The package could be found there:
> > HARMONY-3593
> >
> > The algorithms for charsets encoding/decoding differs from that of
> > ICU, all charsets are generated from current Harmony or any other
> > implementation of Java and could be properly integrated into current
> > nio_char module. The archive contains source files for 6 charsets:
> > GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> > implementation of CharsetProvider; generator for other Charsets and
> > native part. I've tested the package with more that 90 charsets, and
> > all benchmarks and tests passed with new bundle. Additionally I have
> > significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> > current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> > for antlr and ~5-8x for xalan.
> >
> > The main advantages of the package are the following:
> >   - Code for every charset is generated by CharsetGenerator, thus, if
> > some modification would be necessary we need just correct generator
> > and re-generate all sources.
> >   - We use 2 different encoders and decoders for java and direct
> > buffers. Since most applications use java heap buffers, unlike
> > existing implementation it doesn't produce lots of native calls to
> > perform encoding/decoding operations on the java buffers those
> > significantly improving performance. This is the main reason why we
> > have such a significant boost for Dacapo.
> >   - Charset tables for encoding/decoding are stored in appropriate
> > classes.
> >
> > Since the package contains implementation for 6 charsets only,
> > documentations how to generate and build additional charsets you could
> > find in README file from contributed package.
> >
> > Please do not hesitate to contact me for more details.
> >
> > Thanks,
> > Vladimir.
> >
>
>
>
> --
> Best regards,
> Andrew Zhang
>

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Andrew Zhang <zh...@gmail.com>.
Super cool!!!
Does it mean we're not dependent on ICU any more?

On 4/9/07, Vladimir Strigun <vs...@gmail.com> wrote:
>
> Hi all!
>
> I'm happy to announce one more contribution to harmony on behalf of
> Intel. Provided implementation of charset encoders/decoders is
> intended to replace the ICU-based charsets encoding/decoding
> operations. The code was developed in clean-room environment inside
> Intel and I'd like you to play with it and include to current Harmony
> tree.
>
> The package could be found there:
> HARMONY-3593
>
> The algorithms for charsets encoding/decoding differs from that of
> ICU, all charsets are generated from current Harmony or any other
> implementation of Java and could be properly integrated into current
> nio_char module. The archive contains source files for 6 charsets:
> GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> implementation of CharsetProvider; generator for other Charsets and
> native part. I've tested the package with more that 90 charsets, and
> all benchmarks and tests passed with new bundle. Additionally I have
> significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> for antlr and ~5-8x for xalan.
>
> The main advantages of the package are the following:
>   - Code for every charset is generated by CharsetGenerator, thus, if
> some modification would be necessary we need just correct generator
> and re-generate all sources.
>   - We use 2 different encoders and decoders for java and direct
> buffers. Since most applications use java heap buffers, unlike
> existing implementation it doesn't produce lots of native calls to
> perform encoding/decoding operations on the java buffers those
> significantly improving performance. This is the main reason why we
> have such a significant boost for Dacapo.
>   - Charset tables for encoding/decoding are stored in appropriate
> classes.
>
> Since the package contains implementation for 6 charsets only,
> documentations how to generate and build additional charsets you could
> find in README file from contributed package.
>
> Please do not hesitate to contact me for more details.
>
> Thanks,
> Vladimir.
>



-- 
Best regards,
Andrew Zhang

Re: [contribution] Contribution of charset encoders/decoders for NIO_CHAR module

Posted by Stefano Mazzocchi <st...@apache.org>.
Vladimir Strigun wrote:
> Hi all!
> 
> I'm happy to announce one more contribution to harmony on behalf of
> Intel. Provided implementation of charset encoders/decoders is
> intended to replace the ICU-based charsets encoding/decoding
> operations. The code was developed in clean-room environment inside
> Intel and I'd like you to play with it and include to current Harmony
> tree.
> 
> The package could be found there:
> HARMONY-3593
> 
> The algorithms for charsets encoding/decoding differs from that of
> ICU, all charsets are generated from current Harmony or any other
> implementation of Java and could be properly integrated into current
> nio_char module. The archive contains source files for 6 charsets:
> GB18030, US-ASCII, ISO-8859-1, UTF-8, UTF-16, UTF-16BE, UTF-16LE;
> implementation of CharsetProvider; generator for other Charsets and
> native part. I've tested the package with more that 90 charsets, and
> all benchmarks and tests passed with new bundle. Additionally I have
> significant boost for Dacapo.antlr and Dacapo.xalan benchmarks with
> current Harmony tree on DRLVM and IBM VM. On DRLVM I have 2.5x boost
> for antlr and ~5-8x for xalan.
> 
> The main advantages of the package are the following:
>  - Code for every charset is generated by CharsetGenerator, thus, if
> some modification would be necessary we need just correct generator
> and re-generate all sources.
>  - We use 2 different encoders and decoders for java and direct
> buffers. Since most applications use java heap buffers, unlike
> existing implementation it doesn't produce lots of native calls to
> perform encoding/decoding operations on the java buffers those
> significantly improving performance. This is the main reason why we
> have such a significant boost for Dacapo.

wow, this is huge! Is there any significant change in the speed of
string creation too?

>  - Charset tables for encoding/decoding are stored in appropriate classes.
> 
> Since the package contains implementation for 6 charsets only,
> documentations how to generate and build additional charsets you could
> find in README file from contributed package.
> 
> Please do not hesitate to contact me for more details.

Thanks for the awesome contribution.

-- 
Stefano.