You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by arno13 <ar...@healthonnet.org> on 2009/04/30 12:26:45 UTC
fragmenter regexp
Hi,
I don't succeed to use the fragmenter regexp functionality in solr
I'm using solr 1.3 and I defined my fragmenter like this in the
sorlconfigxml:
<!-- A regular-expression-based fragmenter (f.i., for sentence
extraction) -->
<fragmenter name="myregex"
class="org.apache.solr.highlight.RegexFragmenter">
<lst name="defaults">
<!-- a basic sentence pattern -->
<str name="hl.regex.pattern">[-\w ,/\n\"']{100,200}</str>
</lst>
</fragmenter>
my query is the following:
/solr/select?indent=on&version=2.2&q=fever&rows=100&start=0&wt=standard&qt=standard&fl=id,title,url,score,inurl&qf=title^2%20content^1%20site^2%20inurl^2&pf=title^5%20content^2%20site^1%20inurl^1&ps=100&hl=on&hl.fl=content&hl.fragmenter=myregex
However I still have fragments with final punctuation inside, such as:
"and Prevention. . Yellow <em>fever</em> virus, a flavivirus, is transmitted
to humans through"
". Infectious Disease Book . Bacterial Infections. . <em>Fever</em> of
Unknown"
"Rheumatic <em>fever</em>. ARTICLE SECTIONS . Definition. Rheumatic
<em>fever</em>"
What I'm doing wrong?
Thanks
--
View this message in context: http://www.nabble.com/fragmenter-regexp-tp23313514p23313514.html
Sent from the Solr - User mailing list archive at Nabble.com.