You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by "Safarnejad, Ali (AFIS)" <Al...@fao.org> on 2004/12/16 10:22:32 UTC

Aramorph Analyzer

I wanted to share some results from trying out Aramorph Arabic Analyzer with
Lucene.  I experimented with a set of 100 web documents in Windows-1256
encoding.  The indexing took just over 200 seconds, although I had to
increase the heap-size to 500Meg, or I would get OutOfMemory Exceptions
halfway thru the documents.  The 200 seconds includes time to make the url
connection and tidy the documents to extract the text out.

Has anyone done similar experiments with a larger set of Arabic documents?
I'm interested in hearing from anyone else who has used Aramorph with Lucene.

Thanks,
Ali

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org