You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Ivan A. Veselovsky (JIRA)" <ji...@apache.org> on 2012/09/07 20:17:07 UTC
[jira] [Updated] (PIG-2898) Multithreaded execution of e2e tests
[ https://issues.apache.org/jira/browse/PIG-2898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ivan A. Veselovsky updated PIG-2898:
------------------------------------
Patch Info: Patch Available
We provided parallelized mode of the e2e tests execution using Parallel::ForkManager.
Two parameters affect the behavior:
1) file.fork.factor -- max number of subprocesses when running test configuration files (.conf);
2) fork.factor -- max number of subprocesses when running tests within one .conf file.
Total max number of subprocesses canot exceed the product of the 2 values.
Value <= 1 mean no paralellizing.
Example: ant -Dfork.factor=3 -Dfile.fork.factor=3 ... test-e2e
The attached patch is to be applied to http://svn.apache.org/repos/asf/pig/branches/branch-0.9/ branch.
The patch testing procedure gives the following results for the patch:
[exec] -1 overall.
[exec]
[exec] +1 @author. The patch does not contain any @author tags.
[exec]
[exec] +1 tests included. The patch appears to include 24 new or modified tests.
[exec]
[exec] -1 javadoc. The javadoc tool appears to have generated 1 warning messages.
[exec]
[exec] +1 javac. The applied patch does not increase the total number of javac compiler warnings.
[exec]
[exec] +1 findbugs. The patch does not introduce any new Findbugs warnings.
[exec]
[exec] +1 release audit. The applied patch does not increase the total number of release audit warnings.
[exec]
> Multithreaded execution of e2e tests
> ------------------------------------
>
> Key: PIG-2898
> URL: https://issues.apache.org/jira/browse/PIG-2898
> Project: Pig
> Issue Type: Improvement
> Components: e2e harness
> Reporter: Andrey Klochkov
> Assignee: Andrey Klochkov
>
> Today it takes ~19 hours to run the full set of e2e tests in mapred mode. The bottleneck here is the client side, and per our observations it can help a lot if the e2e harness would be able to run tests in parallel threads.
> We prototyped changes in e2e harness allowing to run tests in a configurable number of threads. Preliminary results show more than 6x reduction in execution time when using a small 3-nodes M/R cluster with modest configuration. Going to share a patch shortly.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira