You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Martin (Jira)" <ji...@apache.org> on 2020/03/12 07:30:00 UTC
[jira] [Created] (NIFI-7247) Unable to execute SQL
Martin created NIFI-7247:
----------------------------
Summary: Unable to execute SQL
Key: NIFI-7247
URL: https://issues.apache.org/jira/browse/NIFI-7247
Project: Apache NiFi
Issue Type: Bug
Components: Core Framework
Affects Versions: 1.11.3
Environment: containerized environment on EC2 (amzn2-ami-hvm-2.0.20191116.0-x86_64-gp2)
Reporter: Martin
Fix For: 1.11.4
Scenario:
We use ExecuteSQL to read delta tables (stored in S3) via JDBC connection to databricks.
Temporary Fix:
If we deactivate and reactivate the controller service, then ExecuteSQL works without problems. What is noticeable here, however, is that it takes quite a long time the first time it is executed and the next time it is executed it is done within 3 seconds.
Background information:
* Howto use Databricks JDBC [https://docs.databricks.com/integrations/bi/jdbc-odbc-bi.html]
* Controller Service DBCPConnectionPool 1.11.3
**
URL: jdbc:spark://#\{databricks.host}...\{databricks.cluster.id};...;PWD=#\{databricks.token}
Driver Class: com.simba.spark.jdbc.Driver
* Table
** one column with <20 entries
** Created By Spark 2.4.4
** Type MANAGED
** Provider delta
** Location s3
** Serde Library org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
** InputFormat org.apache.hadoop.mapred.SequenceFileInputFormat
** OutputFormat org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
* SQL
** SELECT * FROM "${db.table.schema}"."${db.table.name}"
** output <20 entries
--
This message was sent by Atlassian Jira
(v8.3.4#803005)