You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Tomi NA <he...@gmail.com> on 2006/10/28 12:35:58 UTC

returning a description of a returned document

Is there a way to have nutch return some hit context (a la google) to
better identify the hit?
For example, if I search for "nutch", a link pointing to
"http://lucene.apache.org/nutch/" would be followed by the following
context:
"This is the first *Nutch* release as an Apache Lucene sub-project.
... *Nutch* is a two-year-old open source project, previously hosted
at Sourceforge and ..."

t.n.a.

Re: returning a description of a returned document

Posted by Tomi NA <he...@gmail.com>.
2006/10/29, Cristina Belderrain <cr...@gmail.com>:
> Hi Tomi,
>
> please take a look at the following tutorial:
>
>    http://today.java.net/pub/a/today/2006/02/16/introduction-to-nutch-2.html
>
> Apparently, Nutch's search application already shows hit summaries...
> Anyway, you can always retrieve each summary programatically using a
> NutchBean instance: please see the sample code towards the end of the
> tutorial.

Silly, I should have looked at the nutch UI .jsps right away: the
thing is, I've been working exclusively on intranet shared folder
searches for some time now and can't explain it (yet), but it seems
that none of the indexed documents have a summary. I only asked the
question in the first place because I've never really noticed a single
summary in the search hits.

I'll look into it and see what kind of explanation i come up with.

Thanks, Cristina.

t.n.a.

Re: returning a description of a returned document

Posted by Cristina Belderrain <cr...@gmail.com>.
Hi Tomi,

please take a look at the following tutorial:

   http://today.java.net/pub/a/today/2006/02/16/introduction-to-nutch-2.html

Apparently, Nutch's search application already shows hit summaries...
Anyway, you can always retrieve each summary programatically using a
NutchBean instance: please see the sample code towards the end of the
tutorial.

Hope this helps,

Cristina