You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2009/03/17 17:03:53 UTC

[Solr Wiki] Trivial Update of "DataImportHandler" by FergusMcMenemie

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The following page has been changed by FergusMcMenemie:
http://wiki.apache.org/solr/DataImportHandler

The comment on the change is:
Clarifing use of HttpDataSource and FileDataSource

------------------------------------------------------------------------------
      <document>
  <dataConfig>
  }}}
- Do not miss the `rootEntity` attribute. The implicit fields generated by the processor are `fileAbsolutePath,fileSize,fileLastModified,fileName`.
+ Do not miss the `rootEntity` attribute. The implicit fields generated by the FileListEntityProcessor are `fileAbsolutePath,fileSize,fileLastModified,fileName` and these are available for use within the entity X as shown above. It should be noted that FileListEntityProcessor returns a list of pathnames and that the subsequent entity must use the FileDataSource to fetch the files content.
  
  === CachedSqlEntityProcessor ===
  [[Anchor(cached)]]
@@ -800, +800 @@

  
  It is designed to iterate rows in DB one by one. A row is represented as a Map.
  === HttpDataSource ===
- This is used by X!PathEntityProcessor . See the documentation [#httpds here] . The signature is as follows
+ This is used by X!PathEntityProcessor to fetch content from HttpDataSources. See the documentation [#httpds here] . The signature is as follows
  {{{
  public class HttpDataSource extends DataSource<Reader>
  }}}
  === FileDataSource ===
- This can be used like an !HttpDataSource . The signature is as follows
+ This can be used like an !HttpDataSource but used to fetch content from files on disk. The signature is as follows
  {{{
  public class FileDataSource extends DataSource<Reader>
  }}}
  
  The attributes are:
-  * '''`basePath`''': (optional) The base path relative to which the value is evaluated
+  * '''`basePath`''': (optional) The base path relative to which the value is evaluated if it is not absolute
   * '''`encoding`''': (optional) If the files are to be read in an encoding that is not same as the platform encoding
  
  === FieldReaderDataSource ===