You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Dishara Wijewardana (JIRA)" <ji...@apache.org> on 2013/03/16 18:12:13 UTC

[jira] [Commented] (PIG-3225) Stratified sampling

    [ https://issues.apache.org/jira/browse/PIG-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13604325#comment-13604325 ] 

Dishara Wijewardana commented on PIG-3225:
------------------------------------------

Hi Gianmarco
I am Dishara who took part in previous GSoC 2012 in Apache Velocity project and successfully completed the JSR 223 implementation. I would like to contribute to the PIG project since it seems pretty interesting. As far as I understand this project idea is basically to implement a tolerable Stratified sampling algorithm on top of PIG. Correct me If I am wrong. Can you provide a bit more details of what aspects I need to look in and get in to this. (like what exactly expected eventually, so that may be I can provide potential algorithm as a patch to simulate this probably before the proposal)  
                
> Stratified sampling
> -------------------
>
>                 Key: PIG-3225
>                 URL: https://issues.apache.org/jira/browse/PIG-3225
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Gianmarco De Francisci Morales
>              Labels: gsoc2013
>
> Implement a stratified sampling option ( http://en.wikipedia.org/wiki/Stratified_sampling ) in Pig's SAMPLE operator.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira