You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (Created) (JIRA)" <ji...@apache.org> on 2012/03/07 21:46:58 UTC
[jira] [Created] (CASSANDRA-4012) Load-from-flat-file data import
tool
Load-from-flat-file data import tool
------------------------------------
Key: CASSANDRA-4012
URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
Project: Cassandra
Issue Type: New Feature
Components: Tools
Reporter: Jonathan Ellis
Assignee: paul cannon
Fix For: 1.1.1
Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line. (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-4012) Load-from-flat-file data import
tool
Posted by "Brandon Williams (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13294638#comment-13294638 ]
Brandon Williams commented on CASSANDRA-4012:
---------------------------------------------
A couple things:
* the help for COPY is wrong, you can't use AND in the column list
* it would be nice if it output the number of rows created and perhaps the time it took at the end, just so the user knows something happened
Finally, since we're targeting 'just want to play with it' users, maybe a full example in examples/ would be useful.
> Load-from-flat-file data import tool
> ------------------------------------
>
> Key: CASSANDRA-4012
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
> Project: Cassandra
> Issue Type: New Feature
> Components: Tools
> Reporter: Jonathan Ellis
> Assignee: paul cannon
> Labels: cqlsh
> Fix For: 1.1.2
>
> Attachments: 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line. (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-4012) Load-from-flat-file data import
tool
Posted by "Jonathan Ellis (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13224697#comment-13224697 ]
Jonathan Ellis commented on CASSANDRA-4012:
-------------------------------------------
I'm fine doing this over python/cql, since we already ship that driver in-tree for cqlsh. I also don't think it needs to be ultra-high-performance, since by exporting it to a text file first you're basically implying that
# performance isn't super critical, or you'd write a custom source -> cassandra loader and skip the intermediate text file step
# you're not concerned with huge data volumes
So basically our target use case is "want to play with some toy data" or importing small datasets.
> Load-from-flat-file data import tool
> ------------------------------------
>
> Key: CASSANDRA-4012
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
> Project: Cassandra
> Issue Type: New Feature
> Components: Tools
> Reporter: Jonathan Ellis
> Assignee: paul cannon
> Fix For: 1.1.1
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line. (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (CASSANDRA-4012) Load-from-flat-file data import
tool
Posted by "paul cannon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
paul cannon updated CASSANDRA-4012:
-----------------------------------
Attachment: 4012.patch-2.txt
Fixed help text and added a message reporting number of rows imported and time elapsed.
Updated the 4012 branch. New submission tagged pending/4012-2.
> Load-from-flat-file data import tool
> ------------------------------------
>
> Key: CASSANDRA-4012
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
> Project: Cassandra
> Issue Type: New Feature
> Components: Tools
> Reporter: Jonathan Ellis
> Assignee: paul cannon
> Labels: cqlsh
> Fix For: 1.1.2
>
> Attachments: 4012.patch-2.txt, 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line. (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (CASSANDRA-4012) Load-from-flat-file data import
tool
Posted by "paul cannon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
paul cannon updated CASSANDRA-4012:
-----------------------------------
Attachment: 4012.patch.txt
Patch attached, or see the 4012 branch in my github. Current version tagged as pending/4012.
https://github.com/thepaul/cassandra/tree/4012
> Load-from-flat-file data import tool
> ------------------------------------
>
> Key: CASSANDRA-4012
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
> Project: Cassandra
> Issue Type: New Feature
> Components: Tools
> Reporter: Jonathan Ellis
> Assignee: paul cannon
> Labels: cqlsh
> Fix For: 1.1.2
>
> Attachments: 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line. (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (CASSANDRA-4012) Load-from-flat-file data import
tool
Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jonathan Ellis updated CASSANDRA-4012:
--------------------------------------
Reviewer: brandon.williams
> Load-from-flat-file data import tool
> ------------------------------------
>
> Key: CASSANDRA-4012
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
> Project: Cassandra
> Issue Type: New Feature
> Components: Tools
> Reporter: Jonathan Ellis
> Assignee: paul cannon
> Labels: cqlsh
> Fix For: 1.1.2
>
> Attachments: 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line. (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira