You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Mohit Sabharwal (JIRA)" <ji...@apache.org> on 2015/05/13 03:49:00 UTC
[jira] [Created] (PIG-4549) Set CROSS operation parallelism for
Spark engine
Mohit Sabharwal created PIG-4549:
------------------------------------
Summary: Set CROSS operation parallelism for Spark engine
Key: PIG-4549
URL: https://issues.apache.org/jira/browse/PIG-4549
Project: Pig
Issue Type: Sub-task
Components: spark
Affects Versions: spark-branch
Reporter: Mohit Sabharwal
Assignee: Mohit Sabharwal
Fix For: spark-branch
Spark engine should set parallelism to be used for CROSS operation by GFCross UDF.
If not set, GFCross throws an exception:
{code}
String s = cfg.get(PigImplConstants.PIG_CROSS_PARALLELISM + "." + crossKey);
if (s == null) {
throw new IOException("Unable to get parallelism hint from job conf");
}
{code}
Estimating parallelism for Spark engine is a TBD item. Until that is done, for CROSS to work, we should use the default parallelism value in GFCross.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)