You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Bing Li (JIRA)" <ji...@apache.org> on 2014/04/25 11:43:19 UTC
[jira] [Updated] (HIVE-3574) Allow Hive to Submit MapReduce jobs
via the MapReduce API (instead of using Hadoop BIN)
[ https://issues.apache.org/jira/browse/HIVE-3574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Bing Li updated HIVE-3574:
--------------------------
Assignee: (was: Bing Li)
> Allow Hive to Submit MapReduce jobs via the MapReduce API (instead of using Hadoop BIN)
> ---------------------------------------------------------------------------------------
>
> Key: HIVE-3574
> URL: https://issues.apache.org/jira/browse/HIVE-3574
> Project: Hive
> Issue Type: Improvement
> Components: Query Processor, SQL
> Affects Versions: 0.3.0, 0.4.0, 0.4.1, 0.5.0, 0.6.0, 0.7.0, 0.7.1, 0.8.0, 0.8.1, 0.9.0, 0.9.1, 0.10.0
> Environment: All environments would be affected by this
> Reporter: Jeremy A. Lucas
> Priority: Minor
> Labels: feature, test
>
> The current behavior of the MapRedTask is to start a process that invokes the "hadoop jar" command, passing each additional jobconf property as an argument to this Hadoop CLI.
> Having Hive to submit generated jobs to an M/R cluster via the MapReduce API would allow for potentially greater compatibility across platforms, in addition to allowing for these jobs to be run easily against pseudo-clusters in tests (think MiniMRCluster).
> This kind of change could involve something as simple as using a Hadoop Configuration object with a generic ToolRunner or something similar to run jobs.
> Specifically, this kind of change would most likely occur in the execute() method of org.apache.hadoop.hive.ql.exec.MapRedTask.
--
This message was sent by Atlassian JIRA
(v6.2#6252)