You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Aniket Mokashi (JIRA)" <ji...@apache.org> on 2014/01/16 23:03:25 UTC

[jira] [Comment Edited] (PIG-3463) Pig should use hadoop local mode for small jobs

    [ https://issues.apache.org/jira/browse/PIG-3463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13872724#comment-13872724 ] 

Aniket Mokashi edited comment on PIG-3463 at 1/16/14 10:02 PM:
---------------------------------------------------------------

Tried few scenarios with the patch-
# Load small data, group all, count - works in local mode.
# Load small data, another small data and replicated join - works in local mode.
# Load small data and order by key - all 3 jobs work in local mode and .
# Load small data and large data for replicated join - first job runs in local mode, second runs in MR mode.
# Load large data and order by key - works in first stage in MR mode and last stages in local mode.


was (Author: aniket486):
Tried few scenarios with the patch-
# Load small data, group all, count - works in local mode.
# Load small data, another small data and replicated join - works in local mode.
# Load small data and order by key - all 3 jobs work in local mode and .
# Load small data and large data for replicated join - first job runs in local mode, second runs in MR mode.
# Load large data and order by key - works in first stages in local mode and last stage in MR mode.

> Pig should use hadoop local mode for small jobs
> -----------------------------------------------
>
>                 Key: PIG-3463
>                 URL: https://issues.apache.org/jira/browse/PIG-3463
>             Project: Pig
>          Issue Type: New Feature
>          Components: impl
>    Affects Versions: 0.11.1
>            Reporter: Aniket Mokashi
>            Assignee: Aniket Mokashi
>             Fix For: 0.13.0
>
>         Attachments: PIG-3463-1.patch, PIG-3463-3.patch
>
>
> Pig should use hadoop local mode for small jobs - few mappers, few reducers and few mb of data.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)