You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Piotr Kołaczkowski (JIRA)" <ji...@apache.org> on 2012/08/30 13:09:07 UTC

[jira] [Created] (CASSANDRA-4588) CQL COPY ... FROM command is slow

Piotr Kołaczkowski created CASSANDRA-4588:
---------------------------------------------

             Summary: CQL COPY ... FROM command is slow
                 Key: CASSANDRA-4588
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4588
             Project: Cassandra
          Issue Type: Bug
          Components: Tools
    Affects Versions: 1.1.4
         Environment: Ubuntu Linux 12.04, kernel 3.4.0
            Reporter: Piotr Kołaczkowski


1. created a csv file with 10,000,000 rows with two integer columns; saved it to an SSD disk, it took a few seconds, the file is 184 MB large. 
2. started a single local cassandra node from fresh empty data and commit log dirs
3. created a keyspace with simple strategy and RF=1
4. loading the file with COPY ... FROM command - it is over 15 minutes now and still loading

top reports about 50% CPU usage for java (cassandra) and 50% for python.
I/O is almost idle, iowait < 0.1%. 



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (CASSANDRA-4588) CQL COPY ... FROM command is slow

Posted by "Brandon Williams (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brandon Williams resolved CASSANDRA-4588.
-----------------------------------------

    Resolution: Won't Fix

Wontfixing, since performance was never a goal as outlined in CASSANDRA-4012
                
> CQL COPY ... FROM command is slow
> ---------------------------------
>
>                 Key: CASSANDRA-4588
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4588
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Tools
>    Affects Versions: 1.1.4
>         Environment: Ubuntu Linux 12.04, kernel 3.4.0
>            Reporter: Piotr Kołaczkowski
>
> 1. created a csv file with 10,000,000 rows with two integer columns; saved it to an SSD disk, it took a few seconds, the file is 184 MB large. 
> 2. started a single local cassandra node from fresh empty data and commit log dirs
> 3. created a keyspace with simple strategy and RF=1
> 4. loading the file with COPY ... FROM command - it is over 15 minutes now and still loading
> top reports about 50% CPU usage for java (cassandra) and 50% for python.
> I/O is almost idle, iowait < 0.1%. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira