You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@manifoldcf.apache.org by kw...@apache.org on 2010/07/26 01:15:01 UTC
svn commit: r979126 - /incubator/lcf/trunk/modules/framework/pull-agent/org/apache/lcf/crawler/interfaces/IRepositoryConnector.java

Author: kwright
Date: Sun Jul 25 23:15:00 2010
New Revision: 979126

URL: http://svn.apache.org/viewvc?rev=979126&view=rev
Log:
Update comment, since it was somewhat out of date and was confusing people.

Modified:
    incubator/lcf/trunk/modules/framework/pull-agent/org/apache/lcf/crawler/interfaces/IRepositoryConnector.java

Modified: incubator/lcf/trunk/modules/framework/pull-agent/org/apache/lcf/crawler/interfaces/IRepositoryConnector.java
URL: http://svn.apache.org/viewvc/incubator/lcf/trunk/modules/framework/pull-agent/org/apache/lcf/crawler/interfaces/IRepositoryConnector.java?rev=979126&r1=979125&r2=979126&view=diff
==============================================================================
--- incubator/lcf/trunk/modules/framework/pull-agent/org/apache/lcf/crawler/interfaces/IRepositoryConnector.java (original)
+++ incubator/lcf/trunk/modules/framework/pull-agent/org/apache/lcf/crawler/interfaces/IRepositoryConnector.java Sun Jul 25 23:15:00 2010
@@ -48,12 +48,16 @@ import java.util.*;
 * It therefore establishes a space of document identifiers.  Each connector will only ever be
 * asked to deal with identifiers that have in some way originated from the connector.
 *
-* Documents are fetched in three stages.  First, the getDocuments() method is called in the connector
-* implementation.  This returns a set of document identifiers.  The document identifiers are used to
-* obtain the current document version strings in the second stage, using the getDocumentVersions() method.
-* The last stage is processDocuments(), which queues up any additional documents needed, and also ingests.
-* This method will not be called if the document version seems to indicate that no document change took
-* place.
+* Documents are fetched by LCF in three stages.  First, the addSeedDocuments() method is called in the connector
+* implementation.  This method is meant to add a set of document identifiers to the queue.  When LCF is ready
+* to process a document, the document identifier is used to obtain a current document version string, using the
+* getDocumentVersions() method (the second stage).  This version string is used to decide whether or not the
+* third stage need be called for the document or not.  The third stage is responsible for sending document content
+* to the output, and for extracting any references to additional documents, and consists of the processDocuments() method.
+*
+* All of these methods interact with LCF by means of an "activity" interface.  For example, an IVersionActivity object
+* is passed to the getDocumentVersions() method, and that object contains methods that are necessary for getDocumentVersions()
+* to do its job.  A similar architecture is used throughout the connector framework.
 */
 public interface IRepositoryConnector
 {