You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/11/10 11:48:00 UTC

[jira] [Work logged] (BEAM-9804) beam.io.BigQuerySource needs permissions to create datasets to be able to run queries

     [ https://issues.apache.org/jira/browse/BEAM-9804?focusedWorklogId=509649&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-509649 ]

ASF GitHub Bot logged work on BEAM-9804:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Nov/20 11:47
            Start Date: 10/Nov/20 11:47
    Worklog Time Spent: 10m 
      Work Description: frankzhao commented on pull request #12960:
URL: https://github.com/apache/beam/pull/12960#issuecomment-724651451


   Refactored to have the `dataset_id` in `BigQueryWrapper` and squashed the commits. Let me know what you think @pabloem 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 509649)
    Time Spent: 2.5h  (was: 2h 20m)

> beam.io.BigQuerySource needs permissions to create datasets to be able to run queries
> -------------------------------------------------------------------------------------
>
>                 Key: BEAM-9804
>                 URL: https://issues.apache.org/jira/browse/BEAM-9804
>             Project: Beam
>          Issue Type: Bug
>          Components: io-py-gcp
>            Reporter: Jonathan Sulman
>            Priority: P3
>          Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> Based on BEAM-8458, which was closed with a Java fix in 2.20.0. However, the bug still exists in the python SDK.
> When using BigQuerySource with the query option, BigQueryReader creates a temporary dataset to store the results of the query.
> Therefore, Beam requires permissions to create datasets just to be able to run a query. In practice, this means that Beam requires the role bigQuery.User just to run queries, whereas if you use {{from}} (to read from a table), the role bigQuery.jobUser suffices.
> BigqueryDataSource should have an option to set an existing dataset  to write the temp results of
>  a query, so it would be enough with having the role bigQuery.jobUser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)