You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Gwen Shapira (JIRA)" <ji...@apache.org> on 2014/11/02 20:29:33 UTC

[jira] [Created] (SQOOP-1656) SQOOP2: Support for "merge" with different changing dimension types

Gwen Shapira created SQOOP-1656:
-----------------------------------

             Summary: SQOOP2: Support for "merge" with different changing dimension types
                 Key: SQOOP-1656
                 URL: https://issues.apache.org/jira/browse/SQOOP-1656
             Project: Sqoop
          Issue Type: Wish
            Reporter: Gwen Shapira


Our current "incremental" design is for "append" only.

However, we do plan on adding "merge" capabilities somewhere in the future. Maybe.

Sqoop1 merges by overwriting existing rows with their newer versions.

But for DWH dimensions, there are other ways to merge:
http://en.wikipedia.org/wiki/Slowly_changing_dimension

For example, preserving both versions and adding "start date" and "end date" for each.

ETL tools can handle these situations. Will be cool if Sqoop2 will be able to do it too.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)