You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@superset.apache.org by "reesercollins (via GitHub)" <gi...@apache.org> on 2023/09/06 17:08:15 UTC

[GitHub] [superset] reesercollins opened a new issue, #20232: [SIP-83] Proposal for copying virtual datasets

reesercollins opened a new issue, #20232:
URL: https://github.com/apache/superset/issues/20232

   ## [SIP-83] Proposal for copying virtual datasets
   
   ### Motivation
   
   There is currently no way for users to copy datasets in superset. Even though they can copy the SQL of a dataset, any changes we make to the field types (e.g. in cases where we turn long, complex types into JSON, or advanced types as described in #17852), to column labels, etc. cannot be easily copied.
   
   ### Proposed Change
   
   Add a duplicate action to virtual datasets within the dataset list:
   ![image](https://user-images.githubusercontent.com/10563996/171485930-09554974-8c77-4509-a18b-c6b6f03d7473.png)
   
   When clicked, it will open a modal which will allow the user to choose a name for the new dataset:
   ![image](https://user-images.githubusercontent.com/10563996/171486469-2cbe0a42-ca7c-4823-8f81-b0c605f9e805.png)
   
   When OK is clicked, a duplicate of the dataset is created with the chosen name.
   
   Here it is in action:
   
   https://user-images.githubusercontent.com/10563996/171488107-d5377bee-4981-464b-b2f7-446a01ac4d54.mp4 
   
   See [this pull request](https://github.com/apache/superset/pull/20309) for a working demo.
   
   ### New dependencies
   
   None.
   
   ### Migration Plan and Compatibility
   
   No migration needed.
   
   ### Rejected Alternatives
   
   No alternatives have been considered at this time.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] ktmud commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
ktmud commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1170358435

   I think it'd be valuable if there is a copy virtual dataset flow in Explore, too.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


Re: [I] [SIP-83] Proposal for copying virtual datasets [superset]

Posted by "rusackas (via GitHub)" <gi...@apache.org>.
rusackas commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-2035009887

   This SIP is at risk of being considered discarded due to lack of activity, if nobody has plans to carry it forward in the near future. Please let us know. You can always revisit/reopen.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] jess-dillard commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
jess-dillard commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1170431150

   Definitely agree with @zuzana-vej and would add only those with dataset edit rights should be able to duplicate to avoid the situation described.
   
   @yousoph Yes - once we combine we'll want that to be the functionality. So we can add in now or address once we get into those dataset CRUD changes.
   
   Small request @reesercollins - Can we change the primary button label from "Ok" to either "Duplicate" or "Create"? Typically we only use "Ok" as a button label if no action was taken in the modal. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


Re: [I] [SIP-83] Proposal for copying virtual datasets [superset]

Posted by "cccs-rc (via GitHub)" <gi...@apache.org>.
cccs-rc commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-2035199898

   Sorry for missing your previous comment, @rusackas! We seem to have gotten this merged back in August 2022. Here's the PR: https://github.com/apache/superset/pull/20309 
   
   There may be a couple bugs hiding in there that are in our backlog to fix, but the bulk of it seems to have been merged upstream. :) 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] yousoph commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
yousoph commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1170346261

   > Thank you for the SIP! @yousoph. Do you have any concerns with having the copy dataset only for virtual datasets? I wonder how this will look after SIP68 work is complete where there will no longer be a column for virtual or physical?
   
   I think it's okay to have the copy only for virtual datasets (especially since there's a limitation on the physical dataset not being able to point to the same db table) but agreed that it might be more confusing later on if virtual/physical aren't displayed as a column in the list view. 
   
   @jess-dillard do you think physical datasets that can't be copied need some explanation, like a disabled duplicate icon with some tooltip text?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] reesercollins commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
reesercollins commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1171187719

   > Definitely agree with @zuzana-vej and would add only those with dataset edit rights should be able to duplicate to avoid the situation described.
   
   Currently there is a permission `can_duplicate`. If someone is concerned with the described situation occurring, they can assign `can_duplicate` and `can_edit` together.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


Re: [I] [SIP-83] Proposal for copying virtual datasets [superset]

Posted by "rusackas (via GitHub)" <gi...@apache.org>.
rusackas closed issue #20232: [SIP-83] Proposal for copying virtual datasets
URL: https://github.com/apache/superset/issues/20232


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] reesercollins closed issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
reesercollins closed issue #20232: [SIP-83] Proposal for copying virtual datasets
URL: https://github.com/apache/superset/issues/20232


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


Re: [I] [SIP-83] Proposal for copying virtual datasets [superset]

Posted by "rusackas (via GitHub)" <gi...@apache.org>.
rusackas commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1743783274

   Closing this as the vote [passed](https://lists.apache.org/thread/ztqdj2n3vtxrofzyq6zbpd2oxgfqlkby). @reesercollins @cccs-rc is there still intent (and a timeline) on which this might get implemented?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] reesercollins commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
reesercollins commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1170380069

   > At some point I think it would be good to add a clone dataset for physical dataset, but give the user the option to choose a new table for the dataset. It is quite common to have the exact same schema for datasets (eg, dev, test, prod data). So far I've been suggesting users use the export dataset feature to change the table and the UUID and reimport as a workaround.
   
   The duplication of datasets only occurs within a single instance of Superset, not across deployments. Export/Import is still the only way to move data between deployments.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] nytai commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
nytai commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1170378775

   At some point I think it would be good to add a clone dataset for physical dataset, but give the user the option to choose a new table for the dataset. It is quite common to have the exact same schema for datasets (eg, dev, test, prod data). So far I've been suggesting users use the export dataset feature to change the table and the UUID and reimport as a workaround. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] eschutho commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
eschutho commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1155892060

   Thank you for the SIP! @yousoph. Do you have any concerns with having the copy dataset only for virtual datasets? I wonder how this will look after SIP68 work is complete where there will no longer be a column for virtual or physical?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] zuzana-vej commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
zuzana-vej commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1170364681

   Definitely +1 for NOT enabling this for physical datasets, only virtual. In terms of virtual datasets, one small concern I have is with the recent changes on ownership of datasets, we would love to prevent cases where users copy dataset, just because they can no longer edit it. Example: Jane D. build dataset and left the company. John Doe wants to edit the dataset, but because he doesn't have access he just goes and copies it, and perhaps changes the chart to use the new dataset. Same scenario can happen in a simpler case when John just doesn't know he needs to ask Jane to be co-onwer to have edit access. As a result, the old dataset is no longer used, although it's not deleted, and it can just cause clutter and confusion.
   
   What I am proposing is to make he UX clear in a way that users can decide whether they should edit or copy, and if only thing preventing them from editing it is the fact they are not co-owners, it should be clear to them how to get themselves on the co-owner list (e.g. ask current owner or admins)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] rusackas commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
rusackas commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1153430198

   @reesercollins Thanks for the proposal and the demo. Could you please post this like (or both links) to the dev@superset.apache.org email list for a proper discussion? If you need help navigating those waters, just say the word and I'll be happy to assist.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] eschutho commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
eschutho commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1176467879

   > At some point I think it would be good to add a clone dataset for physical dataset, but give the user the option to choose a new table for the dataset. It is quite common to have the exact same schema for datasets (eg, dev, test, prod data). So far I've been suggesting users use the export dataset feature to change the table and the UUID and reimport as a workaround.
   
   I think @nytai's idea could be the solution for how to have a duplicate button for both datasets when they get merged together when SIP68 work is done.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] mistercrunch commented on issue #20232: [SIP-83] Proposal for copying virtual datasets

Posted by GitBox <gi...@apache.org>.
mistercrunch commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-1159072828

   Seems like there's a use case for "sibling" tables too, where say you have the same table in prod/staging/dev and you wan to sync the dataset definitions


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


Re: [I] [SIP-83] Proposal for copying virtual datasets [superset]

Posted by "rusackas (via GitHub)" <gi...@apache.org>.
rusackas commented on issue #20232:
URL: https://github.com/apache/superset/issues/20232#issuecomment-2035387208

   Thanks for the reminder/clarification. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org