You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "wolfgang hoschek (JIRA)" <ji...@apache.org> on 2013/11/25 20:12:35 UTC

[jira] [Created] (HIVE-5881) Integrate Hive with Morphlines

wolfgang hoschek created HIVE-5881:
--------------------------------------

             Summary: Integrate Hive with Morphlines
                 Key: HIVE-5881
                 URL: https://issues.apache.org/jira/browse/HIVE-5881
             Project: Hive
          Issue Type: New Feature
          Components: UDF
    Affects Versions: 0.12.0
            Reporter: wolfgang hoschek


Integrate Hive with morphlines. 

Specifically, add support to Hive to call a morphline as a UDTF to leverage existing morphlines ETL functionality. Often, some flexible massaging needs to happen to get the input data into the shape that's desired, and morphline logic helps do this in a user-friendly, pluggable, efficient, pipelined manner. This issue basically boils down to transforming an input row into a morphline record, feeding the record into the morphline processing API, and finally converting zero or more morphline output records into corresponding Hive rows.

Some background is here:

http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html

http://cloudera.github.io/cdk/docs/current/cdk-morphlines/index.html



--
This message was sent by Atlassian JIRA
(v6.1#6144)