You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@manifoldcf.apache.org by kw...@apache.org on 2014/06/10 13:40:28 UTC
svn commit: r1601608 -
/manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/concepts.xml
Author: kwright
Date: Tue Jun 10 11:40:28 2014
New Revision: 1601608
URL: http://svn.apache.org/r1601608
Log:
Update concepts document. Part of CONNECTORS-959.
Modified:
manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/concepts.xml
Modified: manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/concepts.xml
URL: http://svn.apache.org/viewvc/manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/concepts.xml?rev=1601608&r1=1601607&r2=1601608&view=diff
==============================================================================
--- manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/concepts.xml (original)
+++ manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/concepts.xml Tue Jun 10 11:40:28 2014
@@ -100,12 +100,13 @@
<section>
<title>Connectors</title>
<p></p>
- <p>ManifoldCF defines four different kinds of connectors. These are:</p>
+ <p>ManifoldCF defines five different kinds of connectors. These are:</p>
<p></p>
<ul>
<li>User mapping connectors</li>
<li>Authority connectors</li>
<li>Repository connectors</li>
+ <li>Transformation connectors</li>
<li>Output connectors</li>
</ul>
<p></p>
@@ -121,6 +122,7 @@
<tr><td>User mapping connector</td><td>Maps a user name to another (equivalent) user name, typically by means of a regular expression mechanism, or by repository access</td></tr>
<tr><td>Authority connector</td><td>Furnishes a standard way of mapping a user name to access tokens that are meaningful for a given type of repository</td></tr>
<tr><td>Repository connector</td><td>Fetches documents from a specific kind of repository, such as SharePoint or off the web</td></tr>
+ <tr><td>Transformation connector</td><td>Modifies documents or their metadata, after fetched by a repository connector and before being sent to the index by an output connector</td></tr>
<tr><td>Output connector</td><td>Pushes document ingestion requests and deletion requests to a specific kind of back end search engine or other entity, such as Lucene</td></tr>
</table>
<p></p>
@@ -135,9 +137,11 @@
LiveLink connection by specifying how to talk to the LiveLink server. You would <strong>not</strong> include information about which documents to select in such a
configuration.</p>
<p></p>
- <p>There is one difference between how you define a <em>repository connection</em>, vs. how you would define an <em>authority connection</em> or <em>output
- connection</em> or <em>mapping connection</em>. The difference is that you must specify a governing authority connection for your repository connection. This is
- because <strong>all</strong> documents ingested by ManifoldCF need to include appropriate access tokens, and those access tokens are specific to the governing authority.</p>
+ <p>There is one difference between how you define a <em>repository connection</em> or <em>authority connection</em>, vs. how you would define a <em>transformation connection</em> or <em>output
+ connection</em> or <em>mapping connection</em>. The difference is that you must specify a governing authority group for your repository connection, and an owning
+ authority group for your authority connection. This is
+ because <strong>all</strong> documents ingested by ManifoldCF need to include appropriate access tokens, and those access tokens are specific to
+ the governing authority group.</p>
<p></p>
<p>Another difference in how you define an <em>authority connection</em> or <em>mapping connection</em>, vs. other connections, is that you can specify a prerequisite
<em>mapping connection</em> that must occur beforehand. This means you can have multiple user mappings that occur in a defined sequence, before the authority is
@@ -152,9 +156,12 @@
<p></p>
<ul>
<li>A verbal description</li>
- <li>A repository connection (and thus implicitly an authority connection as well)</li>
+ <li>A repository connection (and thus implicitly an authority group as well)</li>
+ <li>Zero or more transformation connections</li>
<li>An output connection</li>
<li>A repository-connection-specific description of "what" documents and metadata the job applies to</li>
+ <li>Zero or more transformation-connection-specific descriptions of "how" documents and metadata should be manipulated before indexing</li>
+ <li> An output-connection-specific description of how documents should be indexed</li>
<li>A model for crawling: either "run to completion", or "run continuously"</li>
<li>A schedule for when the job will run: either within specified time windows, or on demand</li>
</ul>