You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (Jira)" <ji...@apache.org> on 2019/09/18 23:11:00 UTC

[jira] [Commented] (SPARK-29156) Hive has appending data as part of cdc, In write mode we should be able to write only changes captured to teradata or datasource.

    [ https://issues.apache.org/jira/browse/SPARK-29156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932913#comment-16932913 ] 

Hyukjin Kwon commented on SPARK-29156:
--------------------------------------

Can you clarify it and show the reproducer please? I cannot fully understand what this JIRA means.

> Hive has appending data as part of cdc, In write mode we should be able to write only changes captured to teradata or datasource.
> ---------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-29156
>                 URL: https://issues.apache.org/jira/browse/SPARK-29156
>             Project: Spark
>          Issue Type: New Feature
>          Components: Tests
>    Affects Versions: 2.4.3
>         Environment: spark 2.3.2
> dataiku
> aws emr
>            Reporter: raju
>            Priority: Major
>              Labels: patch
>   Original Estimate: 336h
>  Remaining Estimate: 336h
>
> In general change data captures are appended to hive tables. We have scenario where connecting to teradata/ datasource. Only changes captured as updates should be able to write in data source. We are unable to do same by over write and append modes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org