You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Jorge Luis Betancourt Gonzalez <jl...@uci.cu> on 2012/11/26 20:54:45 UTC

Suggester with punctuation signs

Hi:

I've configured my solr setup to use the suggester component and to get terms suggestions from a PHP application, the thing is that I'm getting results like universidad, note the punctuation sign, is there any way I can get rid of this? Or do I need to create a separate field and strip all punctuation signs?.

Greetings

10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS...
CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION

http://www.uci.cu
http://www.facebook.com/universidad.uci
http://www.flickr.com/photos/universidad_uci


Re: Suggester with punctuation signs

Posted by Jorge Luis Betancourt Gonzalez <jl...@uci.cu>.
Hi! Upayavira:

Hi I'm using the standard tokenizer right now, and it's working fine, but I was wondering if there is any form I can strip this punctuation marks right in the suggest requestHandler, so no need for index again. I've been doing some tests and increasing the threshold has improved the accuracy of the suggestions, one more thing is that the suggestions are mainly in spanish, so, any "best practice" configuration for this? or any standard configuration will do the trick?

Thanks!

On Nov 26, 2012, at 6:18 PM, Upayavira <uv...@odoko.co.uk> wrote:

> You may want to change your tokenisation anyhow, as a search for
> 'universidad' will not match your term 'universidad,'
> 
> But you are on the right track - to improve suggestions, improve what is
> in your index.
> 
> Upayavira
> 
> On Mon, Nov 26, 2012, at 07:54 PM, Jorge Luis Betancourt Gonzalez wrote:
>> Hi:
>> 
>> I've configured my solr setup to use the suggester component and to get
>> terms suggestions from a PHP application, the thing is that I'm getting
>> results like universidad, note the punctuation sign, is there any way I
>> can get rid of this? Or do I need to create a separate field and strip
>> all punctuation signs?.
>> 
>> Greetings
>> 
>> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS
>> INFORMATICAS...
>> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
>> 
>> http://www.uci.cu
>> http://www.facebook.com/universidad.uci
>> http://www.flickr.com/photos/universidad_uci
>> 
> 
> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS...
> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
> 
> http://www.uci.cu
> http://www.facebook.com/universidad.uci
> http://www.flickr.com/photos/universidad_uci


10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS...
CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION

http://www.uci.cu
http://www.facebook.com/universidad.uci
http://www.flickr.com/photos/universidad_uci

Re: Suggester with punctuation signs

Posted by Upayavira <uv...@odoko.co.uk>.
You may want to change your tokenisation anyhow, as a search for
'universidad' will not match your term 'universidad,'

But you are on the right track - to improve suggestions, improve what is
in your index.

Upayavira

On Mon, Nov 26, 2012, at 07:54 PM, Jorge Luis Betancourt Gonzalez wrote:
> Hi:
> 
> I've configured my solr setup to use the suggester component and to get
> terms suggestions from a PHP application, the thing is that I'm getting
> results like universidad, note the punctuation sign, is there any way I
> can get rid of this? Or do I need to create a separate field and strip
> all punctuation signs?.
> 
> Greetings
> 
> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS
> INFORMATICAS...
> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
> 
> http://www.uci.cu
> http://www.facebook.com/universidad.uci
> http://www.flickr.com/photos/universidad_uci
>