You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Chengxiang Li (JIRA)" <ji...@apache.org> on 2014/08/11 08:31:12 UTC
[jira] [Created] (HIVE-7675) Implement native HiveMapFunction
Chengxiang Li created HIVE-7675:
-----------------------------------
Summary: Implement native HiveMapFunction
Key: HIVE-7675
URL: https://issues.apache.org/jira/browse/HIVE-7675
Project: Hive
Issue Type: New Feature
Components: Spark
Reporter: Chengxiang Li
Currently, Hive on Spark depend on ExecMapper to execute operator logic, full stack is like: Spark FrameWork=>HiveMapFunction=>ExecMapper=>Hive operators. HiveMapFunction is just a thin wrapper of ExecMapper, this introduce several problems as following:
# ExecMapper is designed for MR single process task mode, it does not work well under Spark multi-thread task node.
# ExecMapper introduce extra API level restriction.
We need implement native HiveMapFunction, as the bridge between Spark framework and Hive operators.
--
This message was sent by Atlassian JIRA
(v6.2#6252)