You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@nifi.apache.org by "Matt Burgess (JIRA)" <ji...@apache.org> on 2016/11/14 14:27:58 UTC

[jira] [Created] (NIFI-3031) Support Multi-Statement Scripts in the PutHiveQL Processor

Matt Burgess created NIFI-3031:
----------------------------------

             Summary: Support Multi-Statement Scripts in the PutHiveQL Processor
                 Key: NIFI-3031
                 URL: https://issues.apache.org/jira/browse/NIFI-3031
             Project: Apache NiFi
          Issue Type: Improvement
            Reporter: Matt Burgess


Trying to use the PutHiveQL processor to execute a HiveQL script that contains multiple statements.

IE: 

USE my_database;

FROM my_database_src.base_table
INSERT OVERWRITE refined_table
SELECT *;

-- or --

use my_database;

create temporary table WORKING as
select a,b,c from RAW;

FROM RAW
INSERT OVERWRITE refined_table
SELECT *;

The current implementation doesn't even like it when you have a semicolon at the end of the single statement.

Either use a default delimiter like a semi-colon to mark the boundaries of a statement within the file or allow them to define there own.

This enables the building of pipelines that are testable by not embedding HiveQL into a product; rather sourcing them from files.  And the scripts can be complex.  Each statement should run in a linear manner and be part of the same JDBC session to ensure things like "temporary" tables will work.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)