You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Ivan A. Veselovsky (JIRA)" <ji...@apache.org> on 2012/09/07 20:17:07 UTC

[jira] [Updated] (PIG-2898) Multithreaded execution of e2e tests

     [ https://issues.apache.org/jira/browse/PIG-2898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ivan A. Veselovsky updated PIG-2898:
------------------------------------

    Patch Info: Patch Available

We provided parallelized mode of the e2e tests execution using Parallel::ForkManager.
Two parameters affect the behavior: 
1) file.fork.factor -- max number of subprocesses when running test configuration files (.conf);
2) fork.factor -- max number of subprocesses when running tests within one .conf file.
Total max number of subprocesses canot exceed the product of the 2 values.
Value <= 1 mean no paralellizing.
Example: ant -Dfork.factor=3 -Dfile.fork.factor=3 ... test-e2e

The attached patch is to be applied to http://svn.apache.org/repos/asf/pig/branches/branch-0.9/ branch.

The patch testing procedure gives the following results for the patch:
     [exec] -1 overall.  
     [exec] 
     [exec]     +1 @author.  The patch does not contain any @author tags.
     [exec] 
     [exec]     +1 tests included.  The patch appears to include 24 new or modified tests.
     [exec] 
     [exec]     -1 javadoc.  The javadoc tool appears to have generated 1 warning messages.
     [exec] 
     [exec]     +1 javac.  The applied patch does not increase the total number of javac compiler warnings.
     [exec] 
     [exec]     +1 findbugs.  The patch does not introduce any new Findbugs warnings.
     [exec] 
     [exec]     +1 release audit.  The applied patch does not increase the total number of release audit warnings.
     [exec] 
                
> Multithreaded execution of e2e tests
> ------------------------------------
>
>                 Key: PIG-2898
>                 URL: https://issues.apache.org/jira/browse/PIG-2898
>             Project: Pig
>          Issue Type: Improvement
>          Components: e2e harness
>            Reporter: Andrey Klochkov
>            Assignee: Andrey Klochkov
>
> Today it takes ~19 hours to run the full set of e2e tests in mapred mode. The bottleneck here is the client side, and per our observations it can help a lot if the e2e harness would be able to run tests in parallel threads.
> We prototyped changes in e2e harness allowing to run tests in a configurable number of threads. Preliminary results show more than 6x reduction in execution time when using a small 3-nodes M/R cluster with modest configuration. Going to share a patch shortly.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira