You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by arno13 <ar...@healthonnet.org> on 2009/04/30 12:26:45 UTC

fragmenter regexp

Hi,

I don't succeed to use the fragmenter regexp functionality in solr

I'm using solr 1.3 and I defined my fragmenter like this in the
sorlconfigxml:

   <!-- A regular-expression-based fragmenter (f.i., for sentence
extraction) -->
   <fragmenter name="myregex"
class="org.apache.solr.highlight.RegexFragmenter">
    <lst name="defaults">
 
      <!-- a basic sentence pattern -->
      <str name="hl.regex.pattern">[-\w ,/\n\"']{100,200}</str>
    </lst>
   </fragmenter>

my query is the following:
/solr/select?indent=on&version=2.2&q=fever&rows=100&start=0&wt=standard&qt=standard&fl=id,title,url,score,inurl&qf=title^2%20content^1%20site^2%20inurl^2&pf=title^5%20content^2%20site^1%20inurl^1&ps=100&hl=on&hl.fl=content&hl.fragmenter=myregex

However I still have fragments with final punctuation inside, such as:

"and Prevention. . Yellow <em>fever</em> virus, a flavivirus, is transmitted
to humans through"
". Infectious Disease Book . Bacterial Infections. . <em>Fever</em> of
Unknown"
"Rheumatic <em>fever</em>. ARTICLE SECTIONS . Definition. Rheumatic
<em>fever</em>"

What I'm doing wrong? 

Thanks
-- 
View this message in context: http://www.nabble.com/fragmenter-regexp-tp23313514p23313514.html
Sent from the Solr - User mailing list archive at Nabble.com.