You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2021/05/11 21:57:00 UTC
[jira] [Updated] (HUDI-1743) Add support for Spark SQL File based
transformer for deltastreamer
[ https://issues.apache.org/jira/browse/HUDI-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
sivabalan narayanan updated HUDI-1743:
--------------------------------------
Labels: features pull-request-available sev:normal (was: features pull-request-available sev:nor)
> Add support for Spark SQL File based transformer for deltastreamer
> ------------------------------------------------------------------
>
> Key: HUDI-1743
> URL: https://issues.apache.org/jira/browse/HUDI-1743
> Project: Apache Hudi
> Issue Type: Improvement
> Components: DeltaStreamer
> Reporter: Vinoth Govindarajan
> Assignee: Vinoth Govindarajan
> Priority: Minor
> Labels: features, pull-request-available, sev:normal
>
> The current SQLQuery based transformer is limited in functionality, you can't pass multiple Spark SQL statements separated by a semicolon which is necessary if your transformation is complex.
>
> The ask is to add a new SQLFileBasedTransformer which takes a Spark SQL file as input with multiple Spark SQL statements and applies the transformation to the delta streamer payload.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)