You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Nicolas Favre-Felix (JIRA)" <ji...@apache.org> on 2013/11/27 13:27:36 UTC
[jira] [Created] (CASSANDRA-6412) Custom creation and merge
functions for user-defined column types
Nicolas Favre-Felix created CASSANDRA-6412:
----------------------------------------------
Summary: Custom creation and merge functions for user-defined column types
Key: CASSANDRA-6412
URL: https://issues.apache.org/jira/browse/CASSANDRA-6412
Project: Cassandra
Issue Type: New Feature
Components: Core
Reporter: Nicolas Favre-Felix
This is a proposal for a new feature, mapping custom types to Cassandra columns.
These types would provide a creation function and a merge function, to be implemented in Java by the user.
This feature relates to the concept of CRDTs; the proposal is to replicate "operations" on these types during write, to apply these operations internally during merge (Column.reconcile), and to also merge their values on read.
The following operations are made possible without reading back any data:
* MIN or MAX(value) for a column
* First value for a column
* Count Distinct
* HyperLogLog
* Count-Min
And any composition of these too, e.g. a Candlestick type includes first, last, min, and max.
The merge operations exposed by these types need to be commutative; this is the case for many functions used in analytics.
This feature is incomplete without some integration with CASSANDRA-4775 (Counters 2.0) which provides a Read-Modify-Write implementation for distributed counters. Integrating custom creation and merge functions with new counters would let users implement complex CRDTs in Cassandra, including:
* Averages & related (sum of squares, standard deviation)
* Graphs
* Sets
* Custom registers (even with vector clocks)
I have a working prototype with implementations for min, max, and Candlestick at https://github.com/acunu/cassandra/tree/crdts - I'd appreciate any feedback on the design and interfaces.
--
This message was sent by Atlassian JIRA
(v6.1#6144)