You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Andy Shimell <an...@devon.gov.uk> on 2011/10/24 22:36:31 UTC

DataImportHandler Nested Entities

Hi,

I want to use Solr 3.1 to index the content of a website. Rather than using a web crawler to fetch the content and load it into Solr I want to use the DIH to get the data from the Content Management Database that supports the website.

It would be possible to write SQL to obtain a complete set of metadata (for example DC.subject or DC.type) for each page or binary document stored in the database, using the JDBCDataSource. One of the values obtained would be the HTTP URL of the actual page or document, and I would need to obtain and index this content as well.

Could you tell me if its possible to nest entities that use a URLDataSource inside entities that use a JDBCDataSource ?

Andy