You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Mike Hugo <mi...@piragua.com> on 2012/02/15 23:07:14 UTC

Size of suggest dictionary

Hello,

We're building an auto suggest component based on the "label" field of
documents.  Is there a way to see how many terms are in the dictionary, or
how much memory it's taking up?  I looked on the statistics page but didn't
find anything obvious.

Thanks in advance,

Mike

ps- here's the config:

    <searchComponent name="suggestlabel" class="solr.SpellCheckComponent">
        <lst name="spellchecker">
            <str name="name">suggestlabel</str>
            <str
name="classname">org.apache.solr.spelling.suggest.Suggester</str>
            <str
name="lookupImpl">org.apache.solr.spelling.suggest.tst.TSTLookup</str>
            <str name="field">label</str>
            <str name="buildOnOptimize">true</str>
        </lst>
    </searchComponent>

    <requestHandler name="suggestlabel"
class="org.apache.solr.handler.component.SearchHandler">
        <lst name="defaults">
            <str name="spellcheck">true</str>
            <str name="spellcheck.dictionary">suggestlabel</str>
            <str name="spellcheck.count">10</str>
        </lst>
        <arr name="components">
            <str>suggestlabel</str>
        </arr>
    </requestHandler>

Re: Size of suggest dictionary

Posted by Mike Hugo <mi...@piragua.com>.
Thanks Em!

What if we use a threshold value in the suggest configuration, like 

      <float name="threshold">0.005</float>

I assume the dictionary size will then be smaller than the total number of distinct terms, is there anyway to determine what that size is?

Thanks,

Mike


On Wednesday, February 15, 2012 at 4:39 PM, Em wrote:

> Hello Mike,
> 
> have a look at Solr's Schema Browser. Click on "FIELDS", select "label"
> and have a look at the number of distinct (term-)values.
> 
> Regards,
> Em
> 
> 
> Am 15.02.2012 23:07, schrieb Mike Hugo:
> > Hello,
> > 
> > We're building an auto suggest component based on the "label" field of
> > documents. Is there a way to see how many terms are in the dictionary, or
> > how much memory it's taking up? I looked on the statistics page but didn't
> > find anything obvious.
> > 
> > Thanks in advance,
> > 
> > Mike
> > 
> > ps- here's the config:
> > 
> > <searchComponent name="suggestlabel" class="solr.SpellCheckComponent">
> > <lst name="spellchecker">
> > <str name="name">suggestlabel</str>
> > <str
> > name="classname">org.apache.solr.spelling.suggest.Suggester</str>
> > <str
> > name="lookupImpl">org.apache.solr.spelling.suggest.tst.TSTLookup</str>
> > <str name="field">label</str>
> > <str name="buildOnOptimize">true</str>
> > </lst>
> > </searchComponent>
> > 
> > <requestHandler name="suggestlabel"
> > class="org.apache.solr.handler.component.SearchHandler">
> > <lst name="defaults">
> > <str name="spellcheck">true</str>
> > <str name="spellcheck.dictionary">suggestlabel</str>
> > <str name="spellcheck.count">10</str>
> > </lst>
> > <arr name="components">
> > <str>suggestlabel</str>
> > </arr>
> > </requestHandler>
> > 
> 
> 
> 



Re: Size of suggest dictionary

Posted by Em <ma...@yahoo.de>.
Hello Mike,

have a look at Solr's Schema Browser. Click on "FIELDS", select "label"
and have a look at the number of distinct (term-)values.

Regards,
Em


Am 15.02.2012 23:07, schrieb Mike Hugo:
> Hello,
> 
> We're building an auto suggest component based on the "label" field of
> documents.  Is there a way to see how many terms are in the dictionary, or
> how much memory it's taking up?  I looked on the statistics page but didn't
> find anything obvious.
> 
> Thanks in advance,
> 
> Mike
> 
> ps- here's the config:
> 
>     <searchComponent name="suggestlabel" class="solr.SpellCheckComponent">
>         <lst name="spellchecker">
>             <str name="name">suggestlabel</str>
>             <str
> name="classname">org.apache.solr.spelling.suggest.Suggester</str>
>             <str
> name="lookupImpl">org.apache.solr.spelling.suggest.tst.TSTLookup</str>
>             <str name="field">label</str>
>             <str name="buildOnOptimize">true</str>
>         </lst>
>     </searchComponent>
> 
>     <requestHandler name="suggestlabel"
> class="org.apache.solr.handler.component.SearchHandler">
>         <lst name="defaults">
>             <str name="spellcheck">true</str>
>             <str name="spellcheck.dictionary">suggestlabel</str>
>             <str name="spellcheck.count">10</str>
>         </lst>
>         <arr name="components">
>             <str>suggestlabel</str>
>         </arr>
>     </requestHandler>
>