You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Hari Sekhon (JIRA)" <ji...@apache.org> on 2014/09/01 19:05:20 UTC

[jira] [Created] (CASSANDRA-7860) csv2sstable - bulk load CSV data to SSTables similar to json2sstable

Hari Sekhon created CASSANDRA-7860:
--------------------------------------

             Summary: csv2sstable - bulk load CSV data to SSTables similar to json2sstable
                 Key: CASSANDRA-7860
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7860
             Project: Cassandra
          Issue Type: New Feature
         Environment: DataStax Community Edition 2.0.9
            Reporter: Hari Sekhon
            Priority: Blocker


Need a csv2sstable utility to bulk load billions of rows of CSV data - impractical to have to pre-convert to json before bulk loading to sstable.

CQL COPY really is too slow - a test of mere 4 million row 6GB CSV directly took 28 minutes... while it only takes 60 secs to cat all the data off hdfs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)