You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2008/02/06 12:35:57 UTC

[Solr Wiki] Update of "DataImportHandler" by ShalinMangar

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The following page has been changed by ShalinMangar:
http://wiki.apache.org/solr/DataImportHandler

New page:
= Overview =

== Motivation ==
Most applications store data in relational databases and searching over such data is a common use-case. However, there is no standard way to import this data into SOLR index requiring custom tools external to SOLR.

== Goals ==
 * Read data residing in relational databases
 * Build SOLR documents by aggregating data from multiple columns and tables according to configuration
 * Update SOLR with such documents
 * Provide ability to do full imports according to configuration
 * Detect inserts/update deltas (changes) and do delta imports (we assume a last-modified timestamp column for this to work)
 * Schedule full imports and delta imports

= Design Overview =
As the name suggests, this is implemented as a SolrRequestHandler. The configuration is provided in two places:
 * solrconfig.xml (DataSource information is read from here e.g. JDBC Driver, JDBC URL, Username, Password etc.)
 * data-config.xml (DB Table/column to SOLR document mapping comes here)

= Usage =
TODO

----
CategorySolrRequestHandler