You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Jorge Luis Betancourt Gonzalez <jl...@uci.cu> on 2012/11/26 20:54:45 UTC
Suggester with punctuation signs
Hi:
I've configured my solr setup to use the suggester component and to get terms suggestions from a PHP application, the thing is that I'm getting results like universidad, note the punctuation sign, is there any way I can get rid of this? Or do I need to create a separate field and strip all punctuation signs?.
Greetings
10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS...
CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
http://www.uci.cu
http://www.facebook.com/universidad.uci
http://www.flickr.com/photos/universidad_uci
Re: Suggester with punctuation signs
Posted by Jorge Luis Betancourt Gonzalez <jl...@uci.cu>.
Hi! Upayavira:
Hi I'm using the standard tokenizer right now, and it's working fine, but I was wondering if there is any form I can strip this punctuation marks right in the suggest requestHandler, so no need for index again. I've been doing some tests and increasing the threshold has improved the accuracy of the suggestions, one more thing is that the suggestions are mainly in spanish, so, any "best practice" configuration for this? or any standard configuration will do the trick?
Thanks!
On Nov 26, 2012, at 6:18 PM, Upayavira <uv...@odoko.co.uk> wrote:
> You may want to change your tokenisation anyhow, as a search for
> 'universidad' will not match your term 'universidad,'
>
> But you are on the right track - to improve suggestions, improve what is
> in your index.
>
> Upayavira
>
> On Mon, Nov 26, 2012, at 07:54 PM, Jorge Luis Betancourt Gonzalez wrote:
>> Hi:
>>
>> I've configured my solr setup to use the suggester component and to get
>> terms suggestions from a PHP application, the thing is that I'm getting
>> results like universidad, note the punctuation sign, is there any way I
>> can get rid of this? Or do I need to create a separate field and strip
>> all punctuation signs?.
>>
>> Greetings
>>
>> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS
>> INFORMATICAS...
>> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
>>
>> http://www.uci.cu
>> http://www.facebook.com/universidad.uci
>> http://www.flickr.com/photos/universidad_uci
>>
>
> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS...
> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
>
> http://www.uci.cu
> http://www.facebook.com/universidad.uci
> http://www.flickr.com/photos/universidad_uci
10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS...
CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
http://www.uci.cu
http://www.facebook.com/universidad.uci
http://www.flickr.com/photos/universidad_uci
Re: Suggester with punctuation signs
Posted by Upayavira <uv...@odoko.co.uk>.
You may want to change your tokenisation anyhow, as a search for
'universidad' will not match your term 'universidad,'
But you are on the right track - to improve suggestions, improve what is
in your index.
Upayavira
On Mon, Nov 26, 2012, at 07:54 PM, Jorge Luis Betancourt Gonzalez wrote:
> Hi:
>
> I've configured my solr setup to use the suggester component and to get
> terms suggestions from a PHP application, the thing is that I'm getting
> results like universidad, note the punctuation sign, is there any way I
> can get rid of this? Or do I need to create a separate field and strip
> all punctuation signs?.
>
> Greetings
>
> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS
> INFORMATICAS...
> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
>
> http://www.uci.cu
> http://www.facebook.com/universidad.uci
> http://www.flickr.com/photos/universidad_uci
>