You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ning Zhang (JIRA)" <ji...@apache.org> on 2010/09/21 19:04:32 UTC

[jira] Commented: (HIVE-1659) parse_url_tuple: a UDTF version of parse_url

    [ https://issues.apache.org/jira/browse/HIVE-1659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12913081#action_12913081 ] 

Ning Zhang commented on HIVE-1659:
----------------------------------

parse_url currently support 2 signatures: parse_url(fullurl, '[QUERY|PATH|HOST|...]') and parse_url(fullurl, 'QUERY', '[ref|sk|...]'). In parse_url_tuple, the syntax is consolidated as parse_url_tuple(fullurl, 'HOST', 'PATH', 'QUERY:ref', 'QUERY:sk',...). 

> parse_url_tuple:  a UDTF version of parse_url
> ---------------------------------------------
>
>                 Key: HIVE-1659
>                 URL: https://issues.apache.org/jira/browse/HIVE-1659
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Ning Zhang
>
> The UDF parse_url take s a URL, parse it and extract QUERY/PATH etc from it. However it can only extract an atomic value from the URL. If we want to extract multiple piece of information, we need to call the function many times. It is desirable to parse the URL once and extract all needed information and return a tuple in a UDTF. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.