You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Ahmed Shiraz Memon <ah...@gmail.com> on 2007/10/28 17:24:38 UTC

Indexing and search of XML based information and Web Services

Hi,

I am Apache Nutch newbie and found found it very interesting the way it is
designed and implemented.  I would like to incorporate nutch within our
middleware which moreover caters Grid Computing users. Well Grid Computing
allows accessibility of several resources which are dispersed geographically
and can be accessed seamlessly. This middleware is web services based,
therefore accessing of resources can be achieved by invoking web services
operations. We have an immediate requirement of developing information
service which gathers all the meta-data about resources and provide this
up-to-date information to brokers (a piece of software) via standard
interface (i.e. API, WS).

Therefore in the context of information services, is it possible to query
apache nutch via any query language (e.g. XQuery most recommended for our
scenario) since all the information which is gathered from different
resource sites are in XML (can be abide by xyz xml schema) format or can it
be extensible?

Other question is to impose restriction on the access of information for
example if user A searches for the resources in Grid then she will only be
allowed to see the information which she is allowed to access?

Thanks in advance for any help,
Shiraz Memon