You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Avlesh Singh <av...@gmail.com> on 2009/10/05 19:27:36 UTC

Re: wildcard searches

>
> No filters are applied to wildcard/fuzzy searches.
>
Ah! Not like that ..
I guess, it is just that the phrase searches using wildcards are not
analyzed.

Cheers
Avlesh

On Mon, Oct 5, 2009 at 10:42 PM, Christian Zambrano <cz...@gmail.com>wrote:

> No filters are applied to wildcard/fuzzy searches.
>
> I couldn't find a reference to this on either the solr or lucene
> documentation but I read it on the Solr book from PACKT
>
>
> On 10/05/2009 12:09 PM, Angel Ice wrote:
>
>> Hi everyone,
>>
>> I have a little question regarding the search engine when a wildcard
>> character is used in the query.
>> Let's take the following example :
>>
>> - I have sent in indexation the word Hésitation (with an accent on the
>> "e")
>> - The filters applied to the field that will handle this word, result in
>> the indexation of "esit" (the mute H is suppressed (home made filter), the
>> accent too (IsoLatin1Filter), and the SnowballPorterFilter suppress the
>> "ation".
>>
>> When i search for "hesitation", "esitation", "ésitation" etc ... all is
>> OK, the document is returned.
>> But as soon as I use a wildcard, like "hésita*", the document is not
>> returned. In fact, I have to put the wildcard in a manner that match the
>> indexed term exactly (example "esi*")
>>
>> Does the search engine applies the filters to the word that prefix the
>> wildcard ? Or does it use this prefix verbatim ?
>>
>> Thanks for you help.
>>
>> Laurent
>>
>

Re: wildcard searches

Posted by Christian Zambrano <cz...@gmail.com>.

On 10/05/2009 01:18 PM, Avlesh Singh wrote:
>> First of all, I know of no way of doing wildcard phrase queries.
>>
>>      
> http://wiki.apache.org/lucene-java/LuceneFAQ#Can_I_combine_wildcard_and_phrase_search.2C_e.g._.22foo_ba.2A.22.3F
>    
Thanks for that link
> When I said not filters, I meant TokenFilters which is what I believe you
>    
>> mean by 'not analyzed'
>>
>>      
> Analysis is a Lucene way of configuring tokenizers and filters for a field
> (index time and query time). I guess, both of us mean the same thing.
>    
You are correct. I should have said ' Not Analyzed'. Thanks for the 
correction.
> Cheers
> Avlesh
>
> On Mon, Oct 5, 2009 at 11:04 PM, Christian Zambrano<cz...@gmail.com>wrote:
>
>    
>> Avlesh, I don't understand your answer.
>>
>> First of all, I know of no way of doing wildcard phrase queries.
>>
>> When I said not filters, I meant TokenFilters which is what I believe you
>> mean by 'not analyzed'
>>
>>
>> On 10/05/2009 12:27 PM, Avlesh Singh wrote:
>>
>>      
>>> No filters are applied to wildcard/fuzzy searches.
>>>        
>>>>
>>>>
>>>>          
>>> Ah! Not like that ..
>>> I guess, it is just that the phrase searches using wildcards are not
>>> analyzed.
>>>
>>> Cheers
>>> Avlesh
>>>
>>> On Mon, Oct 5, 2009 at 10:42 PM, Christian Zambrano<czambran@gmail.com
>>>        
>>>> wrote:
>>>>          
>>>
>>>
>>>        
>>>> No filters are applied to wildcard/fuzzy searches.
>>>>
>>>> I couldn't find a reference to this on either the solr or lucene
>>>> documentation but I read it on the Solr book from PACKT
>>>>
>>>>
>>>> On 10/05/2009 12:09 PM, Angel Ice wrote:
>>>>
>>>>
>>>>
>>>>          
>>>>> Hi everyone,
>>>>>
>>>>> I have a little question regarding the search engine when a wildcard
>>>>> character is used in the query.
>>>>> Let's take the following example :
>>>>>
>>>>> - I have sent in indexation the word Hésitation (with an accent on the
>>>>> "e")
>>>>> - The filters applied to the field that will handle this word, result in
>>>>> the indexation of "esit" (the mute H is suppressed (home made filter),
>>>>> the
>>>>> accent too (IsoLatin1Filter), and the SnowballPorterFilter suppress the
>>>>> "ation".
>>>>>
>>>>> When i search for "hesitation", "esitation", "ésitation" etc ... all is
>>>>> OK, the document is returned.
>>>>> But as soon as I use a wildcard, like "hésita*", the document is not
>>>>> returned. In fact, I have to put the wildcard in a manner that match the
>>>>> indexed term exactly (example "esi*")
>>>>>
>>>>> Does the search engine applies the filters to the word that prefix the
>>>>> wildcard ? Or does it use this prefix verbatim ?
>>>>>
>>>>> Thanks for you help.
>>>>>
>>>>> Laurent
>>>>>
>>>>>
>>>>>
>>>>>            
>>>>
>>>>          
>>>
>>>        
>>      
>    

Re: wildcard searches

Posted by Avlesh Singh <av...@gmail.com>.
>
> First of all, I know of no way of doing wildcard phrase queries.
>
http://wiki.apache.org/lucene-java/LuceneFAQ#Can_I_combine_wildcard_and_phrase_search.2C_e.g._.22foo_ba.2A.22.3F

When I said not filters, I meant TokenFilters which is what I believe you
> mean by 'not analyzed'
>
Analysis is a Lucene way of configuring tokenizers and filters for a field
(index time and query time). I guess, both of us mean the same thing.

Cheers
Avlesh

On Mon, Oct 5, 2009 at 11:04 PM, Christian Zambrano <cz...@gmail.com>wrote:

> Avlesh, I don't understand your answer.
>
> First of all, I know of no way of doing wildcard phrase queries.
>
> When I said not filters, I meant TokenFilters which is what I believe you
> mean by 'not analyzed'
>
>
> On 10/05/2009 12:27 PM, Avlesh Singh wrote:
>
>> No filters are applied to wildcard/fuzzy searches.
>>>
>>>
>>>
>> Ah! Not like that ..
>> I guess, it is just that the phrase searches using wildcards are not
>> analyzed.
>>
>> Cheers
>> Avlesh
>>
>> On Mon, Oct 5, 2009 at 10:42 PM, Christian Zambrano<czambran@gmail.com
>> >wrote:
>>
>>
>>
>>> No filters are applied to wildcard/fuzzy searches.
>>>
>>> I couldn't find a reference to this on either the solr or lucene
>>> documentation but I read it on the Solr book from PACKT
>>>
>>>
>>> On 10/05/2009 12:09 PM, Angel Ice wrote:
>>>
>>>
>>>
>>>> Hi everyone,
>>>>
>>>> I have a little question regarding the search engine when a wildcard
>>>> character is used in the query.
>>>> Let's take the following example :
>>>>
>>>> - I have sent in indexation the word Hésitation (with an accent on the
>>>> "e")
>>>> - The filters applied to the field that will handle this word, result in
>>>> the indexation of "esit" (the mute H is suppressed (home made filter),
>>>> the
>>>> accent too (IsoLatin1Filter), and the SnowballPorterFilter suppress the
>>>> "ation".
>>>>
>>>> When i search for "hesitation", "esitation", "ésitation" etc ... all is
>>>> OK, the document is returned.
>>>> But as soon as I use a wildcard, like "hésita*", the document is not
>>>> returned. In fact, I have to put the wildcard in a manner that match the
>>>> indexed term exactly (example "esi*")
>>>>
>>>> Does the search engine applies the filters to the word that prefix the
>>>> wildcard ? Or does it use this prefix verbatim ?
>>>>
>>>> Thanks for you help.
>>>>
>>>> Laurent
>>>>
>>>>
>>>>
>>>
>>>
>>
>>
>

Re: wildcard searches

Posted by Christian Zambrano <cz...@gmail.com>.
Avlesh, I don't understand your answer.

First of all, I know of no way of doing wildcard phrase queries.

When I said not filters, I meant TokenFilters which is what I believe 
you mean by 'not analyzed'

On 10/05/2009 12:27 PM, Avlesh Singh wrote:
>> No filters are applied to wildcard/fuzzy searches.
>>
>>      
> Ah! Not like that ..
> I guess, it is just that the phrase searches using wildcards are not
> analyzed.
>
> Cheers
> Avlesh
>
> On Mon, Oct 5, 2009 at 10:42 PM, Christian Zambrano<cz...@gmail.com>wrote:
>
>    
>> No filters are applied to wildcard/fuzzy searches.
>>
>> I couldn't find a reference to this on either the solr or lucene
>> documentation but I read it on the Solr book from PACKT
>>
>>
>> On 10/05/2009 12:09 PM, Angel Ice wrote:
>>
>>      
>>> Hi everyone,
>>>
>>> I have a little question regarding the search engine when a wildcard
>>> character is used in the query.
>>> Let's take the following example :
>>>
>>> - I have sent in indexation the word Hésitation (with an accent on the
>>> "e")
>>> - The filters applied to the field that will handle this word, result in
>>> the indexation of "esit" (the mute H is suppressed (home made filter), the
>>> accent too (IsoLatin1Filter), and the SnowballPorterFilter suppress the
>>> "ation".
>>>
>>> When i search for "hesitation", "esitation", "ésitation" etc ... all is
>>> OK, the document is returned.
>>> But as soon as I use a wildcard, like "hésita*", the document is not
>>> returned. In fact, I have to put the wildcard in a manner that match the
>>> indexed term exactly (example "esi*")
>>>
>>> Does the search engine applies the filters to the word that prefix the
>>> wildcard ? Or does it use this prefix verbatim ?
>>>
>>> Thanks for you help.
>>>
>>> Laurent
>>>
>>>        
>>      
>