You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Nikhil Patel (JIRA)" <ji...@apache.org> on 2015/06/04 07:51:41 UTC
[jira] [Commented] (CASSANDRA-4175) Reduce memory, disk space, and
cpu usage with a column name/id map
[ https://issues.apache.org/jira/browse/CASSANDRA-4175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14572204#comment-14572204 ]
Nikhil Patel commented on CASSANDRA-4175:
-----------------------------------------
I am curious to know what is the final take over the column name/id mapping. Is this feature implemented or have plan to do so ?
> Reduce memory, disk space, and cpu usage with a column name/id map
> ------------------------------------------------------------------
>
> Key: CASSANDRA-4175
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4175
> Project: Cassandra
> Issue Type: Improvement
> Reporter: Jonathan Ellis
> Assignee: Jason Brown
> Labels: performance
> Fix For: 3.x
>
>
> We spend a lot of memory on column names, both transiently (during reads) and more permanently (in the row cache). Compression mitigates this on disk but not on the heap.
> The overhead is significant for typical small column values, e.g., ints.
> Even though we intern once we get to the memtable, this affects writes too via very high allocation rates in the young generation, hence more GC activity.
> Now that CQL3 provides us some guarantees that column names must be defined before they are inserted, we could create a map of (say) 32-bit int column id, to names, and use that internally right up until we return a resultset to the client.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)