You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@cassandra.apache.org by "Jeff Jirsa (JIRA)" <ji...@apache.org> on 2016/11/14 04:54:59 UTC

[jira] [Commented] (CASSANDRA-12907) Different data directories for SSDs and HDDs at configuration level

    [ https://issues.apache.org/jira/browse/CASSANDRA-12907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15662767#comment-15662767 ] 

Jeff Jirsa commented on CASSANDRA-12907:
----------------------------------------

There have been similar tickets in the past ( e.g. CASSANDRA-8460 which was DTCS specific, but added a second data file directory config option for 'archived' (spinning) data ).

I think a better option may be some sort of tagged storage - make yaml support a map, where the default is raw data files, and then add a data directory tag to the schema per keyspace or per table, so you can explicitly map keyspaces/tables to named disks for performance or resource isolation as needed.







> Different data directories for SSDs and HDDs at configuration level
> -------------------------------------------------------------------
>
>                 Key: CASSANDRA-12907
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-12907
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Natale Galioto
>              Labels: performance
>
> Currently, users can speed up some CFs by symlinking its data directory to fast media such as SSDs. In my opinion, instead, configuration file should allow two different sets of directory: one dedicated to spindles, one dedicated to SSDs. 
> This would allow a "once and for all mixed SSD & HDD configuration", instead of continuously symlinking the "right" directory each time a CF is created (due to the name mangling of the CF directories).
> And this in turn would allow a priori knowledge on disk structures, and would allow to place indexes of all sort (lookup, partition, etc... everything that is needed to "just" locate data) on fast SSDs, speeding up ALL the CFs instead of only one, while the HDDs could be used just for data retrieval and sequential reads. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)