You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "He Yongqiang (JIRA)" <ji...@apache.org> on 2010/09/13 13:40:33 UTC

[jira] Commented: (HIVE-1624) Patch to allows scripts in S3 location

    [ https://issues.apache.org/jira/browse/HIVE-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12908729#action_12908729 ] 

He Yongqiang commented on HIVE-1624:
------------------------------------

S3 -> client -> cluster maybe better than directly downloading the script from S3 to TaskTracker node.
There may be thousands of concurrent downloading request to S3 for downloading a script. (I agree that the script can be cached in local machine, but right now hive does not do any cache clean up).
S3 -> client -> cluster will be able to use hadoop distributed cache.

> Patch to allows scripts in S3 location
> --------------------------------------
>
>                 Key: HIVE-1624
>                 URL: https://issues.apache.org/jira/browse/HIVE-1624
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Vaibhav Aggarwal
>         Attachments: HIVE-1624.patch
>
>
> I want to submit a patch which allows user to run scripts located in S3.
> This patch enables Hive to download the hive scripts located in S3 buckets and execute them. This saves users the effort of copying scripts to HDFS before executing them.
> Thanks
> Vaibhav

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.