You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "liyunzhang_intel (JIRA)" <ji...@apache.org> on 2016/10/18 17:11:58 UTC
[jira] [Comment Edited] (PIG-5044) Create
SparlCompiler#getSamplingJob in spark mode
[ https://issues.apache.org/jira/browse/PIG-5044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15585999#comment-15585999 ]
liyunzhang_intel edited comment on PIG-5044 at 10/18/16 5:11 PM:
-----------------------------------------------------------------
[~kexianda]: try to use PIG-5044.patch in Skewed Join optimization(PIG-4858) and help review.
was (Author: kellyzly):
[~kexianda]: Help review PIG-5044.patch
> Create SparlCompiler#getSamplingJob in spark mode
> -------------------------------------------------
>
> Key: PIG-5044
> URL: https://issues.apache.org/jira/browse/PIG-5044
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: liyunzhang_intel
> Fix For: spark-branch
>
> Attachments: PIG-5044.patch
>
>
> Like MRCompiler#getSamplingJob, we also need a function like that to sample data from a file, sort sampling data and generate output by UDF(org.apache.pig.impl.builtin.FindQuantiles).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)