You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC

[jira] [Updated] (SPARK-15727) Add UPSERT/MERGE mode to DataFrameWriter

     [ https://issues.apache.org/jira/browse/SPARK-15727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon updated SPARK-15727:
---------------------------------
    Labels: bulk-closed  (was: )

> Add UPSERT/MERGE mode to DataFrameWriter
> ----------------------------------------
>
>                 Key: SPARK-15727
>                 URL: https://issues.apache.org/jira/browse/SPARK-15727
>             Project: Spark
>          Issue Type: Wish
>          Components: SQL
>            Reporter: Ian Hellstrom
>            Priority: Major
>              Labels: bulk-closed
>
> In many cases simply overwriting or appending data is not enough. Instead data should be updated (if already present) and inserted if not. This capability would make the DataFrameWriter an all-purpose storing-to-disk mechanism. RDBMSs often support MERGE or UPSERT, so that we could technically piggyback on that. For databases that do not have MERGE or UPSERT, we could disallow that feature.
> Ideally this feature is an additional 'mode' to make it easier for the user.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org