You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (Created) (JIRA)" <ji...@apache.org> on 2012/03/07 21:46:58 UTC

[jira] [Created] (CASSANDRA-4012) Load-from-flat-file data import tool

Load-from-flat-file data import tool
------------------------------------

                 Key: CASSANDRA-4012
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
             Project: Cassandra
          Issue Type: New Feature
          Components: Tools
            Reporter: Jonathan Ellis
            Assignee: paul cannon
             Fix For: 1.1.1


Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line.  (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (CASSANDRA-4012) Load-from-flat-file data import tool

Posted by "Brandon Williams (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13294638#comment-13294638 ] 

Brandon Williams commented on CASSANDRA-4012:
---------------------------------------------

A couple things:

* the help for COPY is wrong, you can't use AND in the column list
* it would be nice if it output the number of rows created and perhaps the time it took at the end, just so the user knows something happened

Finally, since we're targeting 'just want to play with it' users, maybe a full example in examples/ would be useful.
                
> Load-from-flat-file data import tool
> ------------------------------------
>
>                 Key: CASSANDRA-4012
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Tools
>            Reporter: Jonathan Ellis
>            Assignee: paul cannon
>              Labels: cqlsh
>             Fix For: 1.1.2
>
>         Attachments: 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line.  (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (CASSANDRA-4012) Load-from-flat-file data import tool

Posted by "Jonathan Ellis (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13224697#comment-13224697 ] 

Jonathan Ellis commented on CASSANDRA-4012:
-------------------------------------------

I'm fine doing this over python/cql, since we already ship that driver in-tree for cqlsh.  I also don't think it needs to be ultra-high-performance, since by exporting it to a text file first you're basically implying that

# performance isn't super critical, or you'd write a custom source -> cassandra loader and skip the intermediate text file step
# you're not concerned with huge data volumes

So basically our target use case is "want to play with some toy data" or importing small datasets.
                
> Load-from-flat-file data import tool
> ------------------------------------
>
>                 Key: CASSANDRA-4012
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Tools
>            Reporter: Jonathan Ellis
>            Assignee: paul cannon
>             Fix For: 1.1.1
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line.  (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (CASSANDRA-4012) Load-from-flat-file data import tool

Posted by "paul cannon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

paul cannon updated CASSANDRA-4012:
-----------------------------------

    Attachment: 4012.patch-2.txt

Fixed help text and added a message reporting number of rows imported and time elapsed.

Updated the 4012 branch. New submission tagged pending/4012-2.
                
> Load-from-flat-file data import tool
> ------------------------------------
>
>                 Key: CASSANDRA-4012
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Tools
>            Reporter: Jonathan Ellis
>            Assignee: paul cannon
>              Labels: cqlsh
>             Fix For: 1.1.2
>
>         Attachments: 4012.patch-2.txt, 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line.  (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (CASSANDRA-4012) Load-from-flat-file data import tool

Posted by "paul cannon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

paul cannon updated CASSANDRA-4012:
-----------------------------------

    Attachment: 4012.patch.txt

Patch attached, or see the 4012 branch in my github. Current version tagged as pending/4012.

https://github.com/thepaul/cassandra/tree/4012
                
> Load-from-flat-file data import tool
> ------------------------------------
>
>                 Key: CASSANDRA-4012
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Tools
>            Reporter: Jonathan Ellis
>            Assignee: paul cannon
>              Labels: cqlsh
>             Fix For: 1.1.2
>
>         Attachments: 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line.  (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (CASSANDRA-4012) Load-from-flat-file data import tool

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-4012:
--------------------------------------

    Reviewer: brandon.williams
    
> Load-from-flat-file data import tool
> ------------------------------------
>
>                 Key: CASSANDRA-4012
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4012
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Tools
>            Reporter: Jonathan Ellis
>            Assignee: paul cannon
>              Labels: cqlsh
>             Fix For: 1.1.2
>
>         Attachments: 4012.patch.txt
>
>
> Would like to be able to import data from a comma-delimited or tab-delimited file, one "row" per line.  (That is, "row" can be "logical row" as in http://www.datastax.com/dev/blog/schema-in-cassandra-1-1.)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira