You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Dawid Weiss <da...@gmail.com> on 2013/03/22 08:55:06 UTC

Fastest UFT-8 -> UNICODE converter as state machine.

Something for automata freaks ;) Don't know if it's going to be faster than
Java's intrinsics (with the bounds checking and backbuffer referencing) but
it's definitely elegant, huh? :)

Dawid

---------- Forwarded message ----------
From: Alexey Utkin <al...@oracle.com>
Date: Fri, Mar 22, 2013 at 8:06 AM
Subject: FYI: Fastest UFT-8 -> UNICODE converter as state machine.
To: Alan Bateman <Al...@oracle.com>, "JCG-SPB_RU@oracle.com" <
JCG-SPB_RU@oracle.com>, core-libs-dev <co...@openjdk.java.net>


http://bjoern.hoehrmann.de/**utf-8/decoder/dfa/<http://bjoern.hoehrmann.de/utf-8/decoder/dfa/>
Verified on full UNICODE set.

Regards,
-uta

Re: Fastest UFT-8 -> UNICODE converter as state machine.

Posted by Simon Willnauer <si...@gmail.com>.
nice one!

On Fri, Mar 22, 2013 at 8:55 AM, Dawid Weiss <da...@gmail.com> wrote:
>
> Something for automata freaks ;) Don't know if it's going to be faster than
> Java's intrinsics (with the bounds checking and backbuffer referencing) but
> it's definitely elegant, huh? :)
>
> Dawid
>
> ---------- Forwarded message ----------
> From: Alexey Utkin <al...@oracle.com>
> Date: Fri, Mar 22, 2013 at 8:06 AM
> Subject: FYI: Fastest UFT-8 -> UNICODE converter as state machine.
> To: Alan Bateman <Al...@oracle.com>, "JCG-SPB_RU@oracle.com"
> <JC...@oracle.com>, core-libs-dev <co...@openjdk.java.net>
>
>
> http://bjoern.hoehrmann.de/utf-8/decoder/dfa/
> Verified on full UNICODE set.
>
> Regards,
> -uta
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Re: Fastest UFT-8 -> UNICODE converter as state machine.

Posted by Michael McCandless <lu...@mikemccandless.com>.
Cool :)

Mike McCandless

http://blog.mikemccandless.com


On Fri, Mar 22, 2013 at 3:55 AM, Dawid Weiss <da...@gmail.com> wrote:
>
> Something for automata freaks ;) Don't know if it's going to be faster than
> Java's intrinsics (with the bounds checking and backbuffer referencing) but
> it's definitely elegant, huh? :)
>
> Dawid
>
> ---------- Forwarded message ----------
> From: Alexey Utkin <al...@oracle.com>
> Date: Fri, Mar 22, 2013 at 8:06 AM
> Subject: FYI: Fastest UFT-8 -> UNICODE converter as state machine.
> To: Alan Bateman <Al...@oracle.com>, "JCG-SPB_RU@oracle.com"
> <JC...@oracle.com>, core-libs-dev <co...@openjdk.java.net>
>
>
> http://bjoern.hoehrmann.de/utf-8/decoder/dfa/
> Verified on full UNICODE set.
>
> Regards,
> -uta
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org