You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Ted Dunning (JIRA)" <ji...@apache.org> on 2010/08/04 18:44:24 UTC

[jira] Commented: (MAPREDUCE-1849) Implement a FlumeJava-like library for operations over parallel collections using Hadoop MapReduce

    [ https://issues.apache.org/jira/browse/MAPREDUCE-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12895314#action_12895314 ] 

Ted Dunning commented on MAPREDUCE-1849:
----------------------------------------

The github implementation of Plume now supports local evaluation in an eager as opposed to lazy fashion.  Avro file reading is working in at least one example.  The execution plan optimizer is beginning to work.  We have an emulated map-reduce framework working to support mocking up the full map-reduce execution.  Nobody is working on the Hadoop interface yet so if there is a volunteer for that, they would be very welcome.

Anybody who would like to contribute is welcome as long as they are willing to Apache license their contributions.

See http://github.com/tdunning/Plume for the source,  http://tdunning.blogspot.com/2010/07/new-grool.html for some discussion.

> Implement a FlumeJava-like library for operations over parallel collections using Hadoop MapReduce
> --------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1849
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1849
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>            Reporter: Jeff Hammerbacher
>
> The API used internally at Google is described in great detail at http://portal.acm.org/citation.cfm?id=1806596.1806638.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.