You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jeremy Hanna (JIRA)" <ji...@apache.org> on 2010/06/15 20:01:38 UTC

[jira] Created: (CASSANDRA-1193) Make hadoop functionality available to languages other than Java

Make hadoop functionality available to languages other than Java
----------------------------------------------------------------

                 Key: CASSANDRA-1193
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1193
             Project: Cassandra
          Issue Type: Improvement
          Components: Hadoop
            Reporter: Jeremy Hanna


Currently the hadoop functionality, specifically the MapReduce hooks that are in Cassandra, are only available via Java code.

It would be nice to make that functionality available to other languages, perhaps via hadoop streaming.  This allows any language that can use STDIN/STDOUT to use MapReduce.

It seems like this approach could be realized with a small amount of code to make the inputs/outputs into strings.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CASSANDRA-1193) Make hadoop functionality available to languages other than Java

Posted by "Jeremy Hanna (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jeremy Hanna updated CASSANDRA-1193:
------------------------------------

    Priority: Minor  (was: Major)

> Make hadoop functionality available to languages other than Java
> ----------------------------------------------------------------
>
>                 Key: CASSANDRA-1193
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1193
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>            Reporter: Jeremy Hanna
>            Priority: Minor
>
> Currently the hadoop functionality, specifically the MapReduce hooks that are in Cassandra, are only available via Java code.
> It would be nice to make that functionality available to other languages, perhaps via hadoop streaming.  This allows any language that can use STDIN/STDOUT to use MapReduce.
> It seems like this approach could be realized with a small amount of code to make the inputs/outputs into strings.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (CASSANDRA-1193) Make hadoop functionality available to languages other than Java

Posted by "Stu Hood (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Stu Hood resolved CASSANDRA-1193.
---------------------------------

    Resolution: Duplicate

I guess 1368 handles this.

> Make hadoop functionality available to languages other than Java
> ----------------------------------------------------------------
>
>                 Key: CASSANDRA-1193
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1193
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>            Reporter: Jeremy Hanna
>            Priority: Minor
>
> Currently the hadoop functionality, specifically the MapReduce hooks that are in Cassandra, are only available via Java code.
> It would be nice to make that functionality available to other languages, perhaps via hadoop streaming.  This allows any language that can use STDIN/STDOUT to use MapReduce.
> It seems like this approach could be realized with a small amount of code to make the inputs/outputs into strings.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (CASSANDRA-1193) Make hadoop functionality available to languages other than Java

Posted by "Stu Hood (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879376#action_12879376 ] 

Stu Hood edited comment on CASSANDRA-1193 at 6/16/10 12:52 PM:
---------------------------------------------------------------

One solution would be to implement a TextColumnFamilyInputFormat that wraps ColumnFamilyInputFormat to convert the datastructures it outputs into JSON/TSV/CSV.

EDIT: Actually, rather than Text based serialization, wrapping the data structure in Avro/Thrift structs might be much more performant.

      was (Author: stuhood):
    One solution would be to implement a TextColumnFamilyInputFormat that wraps ColumnFamilyInputFormat to convert the datastructures it outputs into JSON/TSV/CSV.
  
> Make hadoop functionality available to languages other than Java
> ----------------------------------------------------------------
>
>                 Key: CASSANDRA-1193
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1193
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>            Reporter: Jeremy Hanna
>            Priority: Minor
>
> Currently the hadoop functionality, specifically the MapReduce hooks that are in Cassandra, are only available via Java code.
> It would be nice to make that functionality available to other languages, perhaps via hadoop streaming.  This allows any language that can use STDIN/STDOUT to use MapReduce.
> It seems like this approach could be realized with a small amount of code to make the inputs/outputs into strings.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CASSANDRA-1193) Make hadoop functionality available to languages other than Java

Posted by "Stu Hood (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879376#action_12879376 ] 

Stu Hood commented on CASSANDRA-1193:
-------------------------------------

One solution would be to implement a TextColumnFamilyInputFormat that wraps ColumnFamilyInputFormat to convert the datastructures it outputs into JSON/TSV/CSV.

> Make hadoop functionality available to languages other than Java
> ----------------------------------------------------------------
>
>                 Key: CASSANDRA-1193
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1193
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>            Reporter: Jeremy Hanna
>            Priority: Minor
>
> Currently the hadoop functionality, specifically the MapReduce hooks that are in Cassandra, are only available via Java code.
> It would be nice to make that functionality available to other languages, perhaps via hadoop streaming.  This allows any language that can use STDIN/STDOUT to use MapReduce.
> It seems like this approach could be realized with a small amount of code to make the inputs/outputs into strings.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.