You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by hariram ravichandran <ha...@gmail.com> on 2017/01/23 13:59:26 UTC

Decompose compound words

I'm using  Lucene 4.10.4

I would like that if a document contains the word *"* *cheeseburger "* and
the search for *" cheese burger " or " cheese " or " **burger "* should
bring this document.

I tried *" DictionaryCompoundWordTokenFilter ".  * But it uses a
brute-force algorithm. And the words to match against is passed as word
dictionary externally.

Is there any way to match against the indexed words and to do the above in
more efficient way than the " *DictionaryCompoundWordTokenFilter *"