You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2009/03/17 17:03:53 UTC
[Solr Wiki] Trivial Update of "DataImportHandler" by FergusMcMenemie
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The following page has been changed by FergusMcMenemie:
http://wiki.apache.org/solr/DataImportHandler
The comment on the change is:
Clarifing use of HttpDataSource and FileDataSource
------------------------------------------------------------------------------
<document>
<dataConfig>
}}}
- Do not miss the `rootEntity` attribute. The implicit fields generated by the processor are `fileAbsolutePath,fileSize,fileLastModified,fileName`.
+ Do not miss the `rootEntity` attribute. The implicit fields generated by the FileListEntityProcessor are `fileAbsolutePath,fileSize,fileLastModified,fileName` and these are available for use within the entity X as shown above. It should be noted that FileListEntityProcessor returns a list of pathnames and that the subsequent entity must use the FileDataSource to fetch the files content.
=== CachedSqlEntityProcessor ===
[[Anchor(cached)]]
@@ -800, +800 @@
It is designed to iterate rows in DB one by one. A row is represented as a Map.
=== HttpDataSource ===
- This is used by X!PathEntityProcessor . See the documentation [#httpds here] . The signature is as follows
+ This is used by X!PathEntityProcessor to fetch content from HttpDataSources. See the documentation [#httpds here] . The signature is as follows
{{{
public class HttpDataSource extends DataSource<Reader>
}}}
=== FileDataSource ===
- This can be used like an !HttpDataSource . The signature is as follows
+ This can be used like an !HttpDataSource but used to fetch content from files on disk. The signature is as follows
{{{
public class FileDataSource extends DataSource<Reader>
}}}
The attributes are:
- * '''`basePath`''': (optional) The base path relative to which the value is evaluated
+ * '''`basePath`''': (optional) The base path relative to which the value is evaluated if it is not absolute
* '''`encoding`''': (optional) If the files are to be read in an encoding that is not same as the platform encoding
=== FieldReaderDataSource ===