You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Rohini Palaniswamy (JIRA)" <ji...@apache.org> on 2015/02/20 21:24:12 UTC

[jira] [Created] (PIG-4427) Log a warning if the PARALLEL specified is not a prime number

Rohini Palaniswamy created PIG-4427:
---------------------------------------

             Summary: Log a warning if the PARALLEL specified is not a prime number
                 Key: PIG-4427
                 URL: https://issues.apache.org/jira/browse/PIG-4427
             Project: Pig
          Issue Type: Improvement
            Reporter: Rohini Palaniswamy


  Most of the time users specify default_parallel or PARALLEL in multiples of 10. This causes data skew and is not that effective. For eg: Had a user specify 1000 and all records went into 1 reducer, before it was changed to 999. But in some cases where user wants exact number of output files a non-prime number is desired. We should log a warning message if we see a non-prime number, so that it at least makes the user re-look and change config if it was not intentional and was done for sake of ease.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)