You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Daniel Dai (JIRA)" <ji...@apache.org> on 2016/06/23 18:39:16 UTC

[jira] [Resolved] (PIG-2348) Bloom should be able to take a relation or a file

     [ https://issues.apache.org/jira/browse/PIG-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daniel Dai resolved PIG-2348.
-----------------------------
    Resolution: Duplicate

> Bloom should be able to take a relation or a file
> -------------------------------------------------
>
>                 Key: PIG-2348
>                 URL: https://issues.apache.org/jira/browse/PIG-2348
>             Project: Pig
>          Issue Type: Improvement
>          Components: internal-udfs
>    Affects Versions: 0.11
>            Reporter: Alan Gates
>            Assignee: Alan Gates
>            Priority: Minor
>         Attachments: PIG-2348-0.patch
>
>
> Currently Bloom requires the user to have stored the result of a previous query using BuildBloom to an HDFS file before it can be used.  This means the user must submit an "exec" between this store and the filter that uses Bloom if they are to be used in the same script.  
> If Bloom could take a relation as its first input (ala the relation cast to scalar) then users would not need to put an exec in their script or manage a storage location on HDFS.
> Sometimes storing the results in the file makes sense, so we don't want to remove the current behavior, just add another option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)