You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Alan Gates (JIRA)" <ji...@apache.org> on 2011/02/23 22:58:38 UTC
[jira] Commented: (PIG-1867) Allow UDFs that can generate multiple
output tuples from a single input tuple
[ https://issues.apache.org/jira/browse/PIG-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12998570#comment-12998570 ]
Alan Gates commented on PIG-1867:
---------------------------------
Pig already offers this. Have your UDF return a bag. You can then use flatten to iterate through that bag.
> Allow UDFs that can generate multiple output tuples from a single input tuple
> -----------------------------------------------------------------------------
>
> Key: PIG-1867
> URL: https://issues.apache.org/jira/browse/PIG-1867
> Project: Pig
> Issue Type: New Feature
> Reporter: Mathias Herberts
>
> Hive offers this kind of thing using UDTF (User Defined Table generating Functions), it would be very useful for Pig to offer something similar, thus allowing more complex processing.
> One example of such use could be an n-gram generating function.
> I guess EvalFunc could be adapted/morped so exec returns an Iterator<T> instead of T.
> In a first approach, the iterator scanning could be restricted to cases when the UDF is used alone in a generate clause.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira