You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Ian Hellstrom (JIRA)" <ji...@apache.org> on 2016/06/02 06:07:59 UTC
[jira] [Created] (SPARK-15727) Add UPSERT/MERGE mode to
DataFrameWriter
Ian Hellstrom created SPARK-15727:
-------------------------------------
Summary: Add UPSERT/MERGE mode to DataFrameWriter
Key: SPARK-15727
URL: https://issues.apache.org/jira/browse/SPARK-15727
Project: Spark
Issue Type: Wish
Components: SQL
Reporter: Ian Hellstrom
In many cases simply overwriting or appending data is not enough. Instead data should be updated (if already present) and inserted if not. This capability would make the DataFrameWriter an all-purpose storing-to-disk mechanism. RDBMSs often support MERGE or UPSERT, so that we could technically piggyback on that. For databases that do not have MERGE or UPSERT, we could disallow that feature.
Ideally this feature is an additional 'mode' to make it easier for the user.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org