You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Ian Hellstrom (JIRA)" <ji...@apache.org> on 2016/06/02 06:07:59 UTC

[jira] [Created] (SPARK-15727) Add UPSERT/MERGE mode to DataFrameWriter

Ian Hellstrom created SPARK-15727:
-------------------------------------

             Summary: Add UPSERT/MERGE mode to DataFrameWriter
                 Key: SPARK-15727
                 URL: https://issues.apache.org/jira/browse/SPARK-15727
             Project: Spark
          Issue Type: Wish
          Components: SQL
            Reporter: Ian Hellstrom


In many cases simply overwriting or appending data is not enough. Instead data should be updated (if already present) and inserted if not. This capability would make the DataFrameWriter an all-purpose storing-to-disk mechanism. RDBMSs often support MERGE or UPSERT, so that we could technically piggyback on that. For databases that do not have MERGE or UPSERT, we could disallow that feature.

Ideally this feature is an additional 'mode' to make it easier for the user.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org