You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Brent Goran <br...@strategoit.com> on 2005/09/03 08:20:21 UTC

How to search by "links-to"?

Is there a query syntax for Nutch to search all pages which link to
another URL?

e.g.:

link:http://cnn.com
linkto:http://cnn.com
a:http://cnn.com

Any of the above would be a logical syntax for such a feature, but none
work. Did I miss it?




Re: How to search by "links-to"?

Posted by Matthias Jaekle <ja...@eventax.de>.
Hi,
I think this values are not stored in the segements in the moment.
So you could not search them via the interface.
Information about the links is stored in the db.
We have once changed webdbreader/readdb to get this information in csv 
syntax: http://issues.apache.org/jira/browse/NUTCH-75
Maybe you could use this.
Matthias


Brent Goran schrieb:

> Is there a query syntax for Nutch to search all pages which link to
> another URL?
> 
> e.g.:
> 
> link:http://cnn.com
> linkto:http://cnn.com
> a:http://cnn.com
> 
> Any of the above would be a logical syntax for such a feature, but none
> work. Did I miss it?
> 
> 
> 
> 

-- 
http://www.eventax.com - eventax GmbH
http://www.umkreisfinder.de - Die Suchmaschine für Lokales und Events