You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@jackrabbit.apache.org by Smart Pigeon <zw...@yahoo.com> on 2008/05/19 22:39:54 UTC
Does Jackrabbit support right-to-left languages such as Hebrew?
I tried to search some Hebrew docs and it seems it does not work. Can
anyone tell me if Jackrabbit supports the right-to-left languages?
--
View this message in context: http://www.nabble.com/Does-Jackrabbit-support-right-to-left-languages-such-as-Hebrew--tp17327609p17327609.html
Sent from the Jackrabbit - Users mailing list archive at Nabble.com.
RE: Does Jackrabbit support right-to-left languages such as Hebrew?
Posted by Ard Schrijvers <a....@onehippo.com>.
Hello Jukka et al,
> Hi,
>
> That's a bit trickier as Jackrabbit only uses a single Lucene
> Analyzer for a workspace. You should be fine if you don't
> mind using something like the StandardAnalyzer for all your
> content. The drawback is that the index won't then deal as
> well with non-English synonyms, stop words, plurals, etc.
>
> It might be possible, though probably quite tricky, to make
> the Analyzer configurable per node (or subtree), so you may
> want to file a feature request for that.
This feature has been added already quite some time ago, see JCR-1079.
See [1] at the bottom for how to configure a different analyzer per
property
-Ard
[1] http://wiki.apache.org/jackrabbit/IndexingConfiguration
>
> BR,
>
> Jukka Zitting
>
Re: Does Jackrabbit support right-to-left languages such as Hebrew?
Posted by Jukka Zitting <ju...@gmail.com>.
Hi,
On Wed, May 21, 2008 at 9:37 AM, Smart Pigeon <zw...@yahoo.com> wrote:
> Could you be kindly to tell me how to do this kind configuration? Can we
> have both left-to-right and right-to-left language docs in the same
> repository
Yes, that should be no problem.
> and if so, can we formulate quries that can do the searches such
> as a phrase in either language?
That's a bit trickier as Jackrabbit only uses a single Lucene Analyzer
for a workspace. You should be fine if you don't mind using something
like the StandardAnalyzer for all your content. The drawback is that
the index won't then deal as well with non-English synonyms, stop
words, plurals, etc.
It might be possible, though probably quite tricky, to make the
Analyzer configurable per node (or subtree), so you may want to file a
feature request for that.
BR,
Jukka Zitting
Re: Does Jackrabbit support right-to-left languages such as Hebrew?
Posted by Smart Pigeon <zw...@yahoo.com>.
Thank you very much for your reply.
Could you be kindly to tell me how to do this kind configuration? Can we
have both left-to-right and right-to-left language docs in the same
repository and if so, can we formulate quries that can do the searches such
as a phrase in either language?
Jukka Zitting wrote:
>
> Hi,
>
> On Mon, May 19, 2008 at 11:39 PM, Smart Pigeon <zw...@yahoo.com> wrote:
>> I tried to search some Hebrew docs and it seems it does not work. Can
>> anyone tell me if Jackrabbit supports the right-to-left languages?
>
> Jackrabbit is mostly language-independent, as all strings and names
> are stored in Unicode. The only part that's somewhat
> language-dependent is the full text indexing, but you can reconfigure
> the index if the standard analyzer doesn't work well enough for your
> language.
>
> BR,
>
> Jukka Zitting
>
>
--
View this message in context: http://www.nabble.com/Does-Jackrabbit-support-right-to-left-languages-such-as-Hebrew--tp17327609p17356814.html
Sent from the Jackrabbit - Users mailing list archive at Nabble.com.
Re: Does Jackrabbit support right-to-left languages such as Hebrew?
Posted by Jukka Zitting <ju...@gmail.com>.
Hi,
On Mon, May 19, 2008 at 11:39 PM, Smart Pigeon <zw...@yahoo.com> wrote:
> I tried to search some Hebrew docs and it seems it does not work. Can
> anyone tell me if Jackrabbit supports the right-to-left languages?
Jackrabbit is mostly language-independent, as all strings and names
are stored in Unicode. The only part that's somewhat
language-dependent is the full text indexing, but you can reconfigure
the index if the standard analyzer doesn't work well enough for your
language.
BR,
Jukka Zitting