You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@gora.apache.org by le...@apache.org on 2014/07/02 08:21:53 UTC

svn commit: r1607251 - /gora/site/trunk/content/current/gora-cassandra.md

Author: lewismc
Date: Wed Jul  2 06:21:53 2014
New Revision: 1607251

URL: http://svn.apache.org/r1607251
Log:
https://issues.apache.org/jira/browse/GORA-328

Modified:
    gora/site/trunk/content/current/gora-cassandra.md

Modified: gora/site/trunk/content/current/gora-cassandra.md
URL: http://svn.apache.org/viewvc/gora/site/trunk/content/current/gora-cassandra.md?rev=1607251&r1=1607250&r2=1607251&view=diff
==============================================================================
--- gora/site/trunk/content/current/gora-cassandra.md (original)
+++ gora/site/trunk/content/current/gora-cassandra.md Wed Jul  2 06:21:53 2014
@@ -5,25 +5,53 @@ This is the main documentation for the g
 enables [Apache Cassandra](http://cassandra.apache.org) backend support for Gora. 
 
 ##gora.properties 
-* <code>gora.datastore.default=org.apache.gora.cassandra.store.CassandraStore</code> - Implementation of the storage class 
-* <code>gora.cassandra.mapping.file=/path/to/gora-cassandra-mapping.xml</code> - The XML mapping file to be used 
-* <code>gora.cassandra.servers=localhost:9160</code> - This value should specify the host:port for a running Cassandra server or node. In this case the server happens to be running on localhost at port 9160 which is the default Cassandra server configuration.
+<table class="table">
+  <thead>
+   <tr>
+    <th align="left">Property Key</th>
+    <th align="left">Property Value</th> 
+    <th align="left">Required</th>
+    <th align="left">Description</th>
+   </tr>
+  </thead>
+  <tbody>
+   <tr>
+    <td>gora.datastore.default=</td>
+    <td>org.apache.gora.cassandra.store.CassandraStore</td>
+    <td>Yes</td>
+    <td>Implementation of the persistent Java storage class</td>
+   </tr>
+   <tr>
+    <td>gora.cassandra.mapping.file=</td>
+    <td>/path/to/gora-cassandra-mapping.xml</td>
+    <td>No</td>
+    <td>The XML mapping file to be used. If no value is used this defaults to <code>gora-cassandra-mapping.xml</code></td>
+   </tr>
+   <tr>
+    <td>gora.cassandra.servers=</td>
+    <td>localhost:9160</td>
+    <td>Yes</td>
+    <td>This value should specify the host:port for a running Cassandra server or node. In this case the server happens to be running on localhost at port 9160 which is the default Cassandra server configuration. It is important that the <b>host</b> matches that specified in <code>gora-cassandra-mapping.xml</code></td>
+   </tr>
+  </tboday>
+</table>
 
 ##Gora Cassandra mappings 
 Say we wished to map some Employee data and store it into the CassandraStore.
 
     <gora-otd>
-      <keyspace name="Employee" host="localhost" cluster="Gora Cassandra Test Cluster">
-        <family name="p"/>
-        <family name="f"/>
+      <keyspace name="Employee" host="localhost" placement_strategy="org.apache.cassandra.locator.SimpleStrategy"
+      replication_factor="1" cluster="Gora Cassandra Test Cluster">
+        <family name="p" gc_grace_seconds="5"/>
+        <family name="f" gc_grace_seconds="5"/>
         <family name="sc" type="super" />
       </keyspace>
 
       <class name="org.apache.gora.examples.generated.Employee" keyClass="java.lang.String" keyspace="Employee">
-        <field name="name"  family="p" qualifier="info:nm"/>
-        <field name="dateOfBirth"  family="p" qualifier="info:db"/>
-        <field name="ssn"  family="p" qualifier="info:sn"/>
-        <field name="salary"  family="p" qualifier="info:sl"/>
+        <field name="name"  family="p" qualifier="info:nm" ttl="10"/>
+        <field name="dateOfBirth"  family="p" qualifier="info:db" ttl="10"/>
+        <field name="ssn"  family="p" qualifier="info:sn" ttl="10"/>
+        <field name="salary"  family="p" qualifier="info:sl" ttl="10"/>
       </class>
     </gora-otd>
 
@@ -34,11 +62,32 @@ The <b>keyspace</b> element; where we sp
 
 1. a parameter containing the Cassandra keyspace schema name e.g. <b>Employee</b>, 
 
-2. a parameter containing the host e.g. <b>localhost</b>, 
+2. a parameter containing the host e.g. <b>localhost</b>. The value of the host attribute of keyspace tag should match exactly what is in
+   gora.properties file. Essentially this means that if you are using port number, you should
+   use it everywhere regardless of whether it is the default port or not.
+   At runtime Gora will otherwise try to connect to localhost. For more information please see [here](https://issues.apache.org/jira/browse/GORA-269)
 
 3. a parameter containing the Cassandra cluster name e.g. <b>Gora Cassandra Test Cluster</b>,
 
-4. A nested element containing the name and type of column families we wish to create within Cassandra. In this case we create three columns;  <b>p</b>, <b>f</b> and <b>sc</b> the last of which contains an optional <b>type</b> attribute which further defines this as a super column. 
+4. a parameter containing a <b>placement_strategy</b>: The value of 'placement_strategy' should be a fully qualifed class name that is known to
+   the cassansra cluster, not the application or Gora. As of this writing, the classes that ship
+   with cassandra are:
+   <code>org.apache.cassandra.locator.SimpleStrategy</code> and 
+   <code>org.apache.cassandra.locator.NetworkTopologyStrategy</code>.
+   gora-cassandra will use SimpleStrategy by default if no value for this attribute is specified. Finally 
+   it should be noted that the placement_strategy attribute of the keyspace tag
+   will only apply if Gora creates the Cassandra Keyspace. More about placement strategies can be found
+   [here](http://www.datastax.com/documentation/cassandra/1.2/cassandra/architecture/architectureDataDistributeReplication_c.html).
+
+5. a parameter containing a <b>replication_factor</b> attribute with value integer. Again the replacation_factor value associated with the Keyspace tag
+   will only apply if Gora creates the Keyspace and will have no effect if this is being used against 
+   an existing keyspace. the default value for 'replication_factor' is '1'. <b>N.B.</b>In Cassandra this property is required if the placement_strategy 
+   class is SimpleStrategy; otherwise, not used. This value essentially relates to the number of replicas of data you want to reside on multiple nodes.
+
+6. A child element <b>family</b> containing the <b>name</b>, <b>type</b> and <b>gc_grace_seconds</b> parameters for column families we wish to create within Cassandra. In this case we create three columns;  <b>p</b>, <b>f</b> and <b>sc</b> the last of which contains an optional <b>type</b> attribute which further defines this as a super column. 
+   Additonally, column families <b>p</b> and <b>f</b> assign a value of 5 to <b>gc_grace_seconds</b>. In Gora we define the default value of 'gc_grace_seconds' as '0' which is ONLY VIABLE FOR A SINGLE NODE
+   CLUSTER. You should update this value according to your [cluster configuration](https://wiki.apache.org/cassandra/StorageConfiguration). 
+   Columns marked with a gc_grace_seconds exist for a configured time period. More information can be found [here](http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/dml_about_deletes_c.html)
 
 The <b>class</b> element specifying persistent fields which values should map to. This element contains; 
 
@@ -55,3 +104,9 @@ The <b>class</b> element specifying pers
     a parameter containing the column <b>family</b> to which the field belongs e.g. (all p in this case), 
 
     an optional parameter <b>qualifier</b>, which enables more granular control over the data to be persisted into Cassandra.
+
+    an optional patameter <b>ttl</b> (time to live): the value of the 'ttl' attribute should most likely always
+   be zero unless you want Cassandra to create Tombstones and delete portions of your
+   data once this period expires. Any positive value is read and bound to the number
+   of seconds after which the value for that field will disappear. The default value of ttl
+   is '0'.
\ No newline at end of file