You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Martí Quixal <ma...@gmail.com> on 2012/11/21 21:00:54 UTC

Using payloads to encode part-of-speech in Solr 4.0.0

Dear list members,

I am trying to figure out how to configure schema.xml in solr 4.0.0 so that
it takes into account part-of-speech (PoS) tags to index documents and
filter queries, all of it by using payloads.

The schema.xml file includes a payloads field in Solr 4.0.0. From the
comments I have learnt that payloads require an encoder whose values can be
float, integer or identity. However none of these seem to me appropriate to
encode PoS tags (they are rather strings: NNS, VLInf, CSUB, etc. well,
actually ugly|ADJ man|NNS).

Using identity as the encoder type the indexer does not complain (using the
other two it does). However, I haven't been able to filter queries
information on the right-hand side of the payloads delimiter (|) in the
payloads (ADJ or NNS in ugly|ADJ man|NNS).

I will appreciate any help or pointers you can provide me with. I will be
happy to provide more details if needed.

Best regards,
Martí

-- 
Martí Quixal
Computational Linguist & Educational Technologist
http://www.iqubo.org/quixal