You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/07/06 23:56:11 UTC

[jira] [Commented] (BEAM-428) InProcessRunner - Bundle based local runner

    [ https://issues.apache.org/jira/browse/BEAM-428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15365353#comment-15365353 ] 

ASF GitHub Bot commented on BEAM-428:
-------------------------------------

GitHub user aaltay opened a pull request:

    https://github.com/apache/incubator-beam/pull/598

    [BEAM-428] InProcessRunner - Bundle based local runner

    Be sure to do all of the following to help us incorporate your contribution
    quickly and easily:
    
     - [ ] Make sure the PR title is formatted like:
       `[BEAM-<Jira issue #>] Description of pull request`
     - [ ] Make sure tests pass via `mvn clean verify`. (Even better, enable
           Travis-CI on your fork and ensure the whole test matrix passes).
     - [ ] Replace `<Jira issue #>` in the title with the actual Jira issue
           number, if there is one.
     - [ ] If this contribution is large, please file an Apache
           [Individual Contributor License Agreement](https://www.apache.org/licenses/icla.txt).
    
    ---
    InProcessRunner is a bundle based runner for doing local execution. It will eventually be a drop-in replacement for the DirectRunner.
    
    It executes pipelines by dividing inputs into bundles and running ptransforms in bundles. Similar to its Java counterpart it aims to provide better performance and validation capabilities compared to the current DirectRunner. InProcessRunner is self contained in its own directory. This will be the case until the replacement happens.
    
    Tests mainly consists of wrapping existing DirectRunner tests and executing them against this new runner.
    
    This version supports batch execution only, future updates will add streaming supports.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/aaltay/incubator-beam inproc

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-beam/pull/598.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #598
    
----
commit a559adb978a0762ef2f5df3f089d30b8ea1c9e5f
Author: Ahmet Altay <al...@google.com>
Date:   2016-07-06T22:51:50Z

    In process runner with bundled execution. Executes batch pipeline in
    bundles in parallel.
    
    It has tests for running against existing DirectRunner related tests. It
    is self contained currently in its own folder.

commit e995883828391188791d31c476a3a14ac292bafe
Author: Ahmet Altay <al...@google.com>
Date:   2016-07-06T23:18:15Z

    Workaround for ThreadPool creash in fileio for older version of Python.

----


> InProcessRunner - Bundle based local runner
> -------------------------------------------
>
>                 Key: BEAM-428
>                 URL: https://issues.apache.org/jira/browse/BEAM-428
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-py
>            Reporter: Ahmet Altay
>            Assignee: Ahmet Altay
>
> InProcessRunner is a bundle based drop in replacement for DirectRunner.
> Similar to its Java equivalent it improves DirectRunner by executing transforms in parallel using bundles similar to a service based implementations. It offers better performance and more validation options.
> Initially it will be a runner for executing batch jobs only. The target of this phase is to develop a drop in replacement for DirectRunner. Later it will be improved by adding streaming execution.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)