You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@crunch.apache.org by "Josh Wills (JIRA)" <ji...@apache.org> on 2014/08/01 06:58:39 UTC

[jira] [Updated] (CRUNCH-449) Add sequentialDo function for injecting arbitrary non-parallel code

     [ https://issues.apache.org/jira/browse/CRUNCH-449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Josh Wills updated CRUNCH-449:
------------------------------

    Attachment: CRUNCH-449d.patch

And now the Spark impl is done. Probably ready for another look.

> Add sequentialDo function for injecting arbitrary non-parallel code
> -------------------------------------------------------------------
>
>                 Key: CRUNCH-449
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-449
>             Project: Crunch
>          Issue Type: Bug
>          Components: Core
>            Reporter: Josh Wills
>            Assignee: Josh Wills
>         Attachments: CRUNCH-449.patch, CRUNCH-449b.patch, CRUNCH-449c.patch, CRUNCH-449d.patch
>
>
> I've been noodling on this one for awhile: how to add the ability to execute some code if and only if one or more targets are created, and have that executed code (optionally) return one or more new PCollections as a result. I was thinking that this functionality could be wired in to libraries to do things like bulk loading HBase tables or running Sqoop jobs as part of Crunch pipelines automatically.



--
This message was sent by Atlassian JIRA
(v6.2#6252)