You are viewing a plain text version of this content. The canonical link for it is here.

Posted to solr-user@lucene.apache.org by Joe Calderon <ca...@gmail.com> on 2009/12/15 21:30:00 UTC

wildcard oddity

im trying to do a wild card search

"q":"item_title:(gets*)"    returns no results
"q":"item_title:(gets)"    returns results
"q":"item_title:(get*)"    returns results


seems like * at the end of a token is requiring a character, instead
of being 0 or more its acting like1 or more

the text im trying to match is "The Gang Gets Extreme: Home Makeover Edition"

the field uses the following analyzers

    <fieldType name="text_token" class="solr.TextField"
positionIncrementGap="100" omitNorms="false">
      <analyzer>
        <charFilter class="solr.HTMLStripCharFilterFactory" />
        <tokenizer class="solr.WhiteSpaceTokenizerFactory" />
        <filter class="solr.LowerCaseFilterFactory" />
        <filter class="solr.ISOLatin1AccentFilterFactory" />
        <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" generateNumberParts="0" catenateAll="1"
splitOnNumerics="0" splitOnCaseChange="0" stemEnglishPossessive="0" />
      </analyzer>
    </fieldType>


is anybody else having similar problems?


best,
--joe

Re: wildcard oddity

Posted by Erick Erickson <er...@gmail.com>.

Do you get the same behavior if you search for "gang" instead of "gets"?
I'm wondering if there's something going on with stemEnglishPossesive.

According to the docs you *should* be OK since you set
setmEnglishPosessive=0,
but this would help point in the right direction.

Also, am I correct in assuming that that is the analyzer both for indexing
AND searching?

Best
Erick

On Tue, Dec 15, 2009 at 3:30 PM, Joe Calderon <ca...@gmail.com>wrote:

> im trying to do a wild card search
>
> "q":"item_title:(gets*)"    returns no results
> "q":"item_title:(gets)"    returns results
> "q":"item_title:(get*)"    returns results
>
>
> seems like * at the end of a token is requiring a character, instead
> of being 0 or more its acting like1 or more
>
> the text im trying to match is "The Gang Gets Extreme: Home Makeover
> Edition"
>
> the field uses the following analyzers
>
>    <fieldType name="text_token" class="solr.TextField"
> positionIncrementGap="100" omitNorms="false">
>      <analyzer>
>        <charFilter class="solr.HTMLStripCharFilterFactory" />
>        <tokenizer class="solr.WhiteSpaceTokenizerFactory" />
>        <filter class="solr.LowerCaseFilterFactory" />
>        <filter class="solr.ISOLatin1AccentFilterFactory" />
>        <filter class="solr.WordDelimiterFilterFactory"
> generateWordParts="1" generateNumberParts="0" catenateAll="1"
> splitOnNumerics="0" splitOnCaseChange="0" stemEnglishPossessive="0" />
>      </analyzer>
>    </fieldType>
>
>
> is anybody else having similar problems?
>
>
> best,
> --joe
>