You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2009/08/15 09:25:17 UTC

[Solr Wiki] Update of "CollectionDistribution" by HossMan

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The following page has been changed by HossMan:
http://wiki.apache.org/solr/CollectionDistribution

The comment on the change is:
link to new java replication, and long overdue cleanup of TODO items

------------------------------------------------------------------------------
- /!\ :TODO: /!\ update final packaging strategy for scripts.
+ This document describes the ssh/rsync based replication available since ["Solr1.1"] '''This mechanism only works on systems that support removing open hard links'''.  For information on the Java implementation of index replication that works over HTTP and was introduced in ["Solr1.4"] please consult SolrReplication.
+ 
+ [[TableOfContents]]
+ 
+ == Introduction ==
  
  Solr distribution is similar in concept to database replication.  All collection changes come to one master Solr server. All production queries are done against query slaves. Query slaves receive all their collection changes indirectly &#151; as new versions of a collection which they pull from the master.
  These collection downloads are polled for on a cron'd basis.
@@ -49, +53 @@

  == Solr Distribution Scripts ==
  
     * For the Solr distribution scripts, the name of the index directory is defined either by the environment variable '''data_dir''' in the configuration file '''solr/conf/scripts.conf''' or the command line argument '''-d'''.  It should match the value used by the Solr server which is defined in '''solr/conf/solrconfig.xml'''.
-    * All Solr collection distribution scripts are bundled within a Solr release and reside in the directory '''solr/bin''' of each instance of Solr. [[BR]] /!\ :TODO: /!\ revise pending final packaging strategy
+    * All Solr collection distribution scripts are bundled within a Solr release and reside in the directory '''solr/src/scripts'''.  It is suggested that the be installed in a '''solr/bin/''' directory. 
     * Collection distribution scripts create and prepare for distribution a snapshot of a search collection after each '''commit''' and '''optimize''' request if the ''postCommit'' and ''postOptimize'' event listener is configured in '''solr/conf/solrconfig.xml''' to execute '''snapshooter'''.
     * The '''snapshooter''' script creates a directory ''snapshot.&#60;ts&#62;'', where &#60;ts&#62; is a timestamp in the format, yyyymmddHHMMSS.  It contains hard links to the data files.
     * Snapshots are distributed from the master server when the slaves pull them, "smartcopying" the snapshot directory that contains the hard links to the most recent collection data files.