You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Pablo Queixalos <pa...@polyspot.com> on 2011/08/12 10:28:16 UTC

Clustering not working when using 'text' field as snippet.

Hi,

 

 

I am using solr-3.3.0 and carrot² clustering which works fine out of the box with the examples doc and default solr configuration (the 'features' Field is used as snippet).

 

I indexed my own documents using the embed ExtractingRequestHandler wich by default stores contents in the 'text' Field. When configuring clustering on 'text' as snippet, carrot doesn't work fine and only shows 'Other topics' with all the documents within. It looks like carrot doesn't get the 'text' Field stored content.

 

 

If I store the documents content in the 'features' field and get back to the original configuration clustering works fine.

 

The only difference I see between 'text' and 'features' Fields in schema.xml is that some CopyFields are defined for 'text'.

 

 

I didn't debug solr.clustering.ClusteringComponent nor CarrotClusteringEngine yet, but am I misunderstanding something about the 'text' Field ? 

 

 

Thanks,

 

Pablo.


RE: Clustering not working when using 'text' field as snippet.

Posted by Pablo Queixalos <pa...@polyspot.com>.
Thanks for your reply Staszek,


Of course, the field has to be stored. I forgot to mention that I already updated the schema for that. I also checked that data was effectiveley stored in that field. 

Anyway, I tried to reproduce it on a fresh Solr install and clustering works well. ;-)


Pablo.

-----Message d'origine-----
De : stachoo@gmail.com [mailto:stachoo@gmail.com] De la part de Stanislaw Osinski
Envoyé : vendredi 12 août 2011 11:00
À : solr-user@lucene.apache.org
Objet : Re: Clustering not working when using 'text' field as snippet.

Hi Pablo,

The reason clustering doesn't work with the "text" field is that the field is not stored:

 <field name="text" type="text_general" indexed="true" stored="false"
multiValued="true"/>

For clustering to work, you'll need to keep your documents' titles and content in stored fields.

Staszek


On Fri, Aug 12, 2011 at 10:28, Pablo Queixalos <pablo.queixalos@polyspot.com
> wrote:

> Hi,
>
>
>
>
>
> I am using solr-3.3.0 and carrot² clustering which works fine out of 
> the box with the examples doc and default solr configuration (the 'features'
> Field is used as snippet).
>
>
>
> I indexed my own documents using the embed ExtractingRequestHandler 
> wich by default stores contents in the 'text' Field. When configuring 
> clustering on 'text' as snippet, carrot doesn't work fine and only shows 'Other topics'
> with all the documents within. It looks like carrot doesn't get the 'text'
> Field stored content.
>
>
>
>
>
> If I store the documents content in the 'features' field and get back 
> to the original configuration clustering works fine.
>
>
>
> The only difference I see between 'text' and 'features' Fields in 
> schema.xml is that some CopyFields are defined for 'text'.
>
>
>
>
>
> I didn't debug solr.clustering.ClusteringComponent nor 
> CarrotClusteringEngine yet, but am I misunderstanding something about 
> the 'text' Field ?
>
>
>
>
>
> Thanks,
>
>
>
> Pablo.
>
>

Re: Clustering not working when using 'text' field as snippet.

Posted by Stanislaw Osinski <st...@carrotsearch.com>.
Hi Pablo,

The reason clustering doesn't work with the "text" field is that the field
is not stored:

 <field name="text" type="text_general" indexed="true" stored="false"
multiValued="true"/>

For clustering to work, you'll need to keep your documents' titles and
content in stored fields.

Staszek


On Fri, Aug 12, 2011 at 10:28, Pablo Queixalos <pablo.queixalos@polyspot.com
> wrote:

> Hi,
>
>
>
>
>
> I am using solr-3.3.0 and carrot² clustering which works fine out of the
> box with the examples doc and default solr configuration (the 'features'
> Field is used as snippet).
>
>
>
> I indexed my own documents using the embed ExtractingRequestHandler wich by
> default stores contents in the 'text' Field. When configuring clustering on
> 'text' as snippet, carrot doesn't work fine and only shows 'Other topics'
> with all the documents within. It looks like carrot doesn't get the 'text'
> Field stored content.
>
>
>
>
>
> If I store the documents content in the 'features' field and get back to
> the original configuration clustering works fine.
>
>
>
> The only difference I see between 'text' and 'features' Fields in
> schema.xml is that some CopyFields are defined for 'text'.
>
>
>
>
>
> I didn't debug solr.clustering.ClusteringComponent nor
> CarrotClusteringEngine yet, but am I misunderstanding something about the
> 'text' Field ?
>
>
>
>
>
> Thanks,
>
>
>
> Pablo.
>
>