You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Berlin Brown <be...@gmail.com> on 2006/03/20 06:13:54 UTC

One more question, getSummary and HTML output

I am using the client libraries to query nutch, meaning I am not
running from Tomcat.  Is it possible to return the summary without the
HTML code?  Or do I need an HTML parser to do that.

# Return the HTML summary for this query
summary = bean.getSummary(details, query)
print summary

URL:  http://spacefinder.chicagoreader.com/movies/sts/showtimes.html
<b> ... </b>Fr-Sa also 10:00 am The Shaggy <b>Dog</b> Daily 1:35,
4:10, 6:50, 9:15; Fr<b> ... </b>
done.