You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/10/01 00:04:00 UTC

[jira] [Work logged] (BEAM-10962) Multiple PubSub Source Reader

     [ https://issues.apache.org/jira/browse/BEAM-10962?focusedWorklogId=493237&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-493237 ]

ASF GitHub Bot logged work on BEAM-10962:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Oct/20 00:03
            Start Date: 01/Oct/20 00:03
    Worklog Time Spent: 10m 
      Work Description: boyuanzz edited a comment on pull request #12930:
URL: https://github.com/apache/beam/pull/12930#issuecomment-701708484


   Except what Cham has mentioned, another thing is current implementation of `MultipleReadFromPubSub` only can configure multiple `ReadFromPubSub` with the same attribute, like the same `with_attributes`, `timestamp_label`, `id_label`, which is not ideal. Given that `ReadPubSub` is a native transform for Dataflow, having `MultipleReadFromPubSub` seems like the only solution for now. I'm thinking we could create a `PubSubSourceDescriptor` which includes `topic`, `subscription` and other attributes. And we expose `add` API from `MultipleReadFromPubSub` to allow end users to add a new Read.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 493237)
    Time Spent: 2h 40m  (was: 2.5h)

> Multiple PubSub Source Reader 
> ------------------------------
>
>                 Key: BEAM-10962
>                 URL: https://issues.apache.org/jira/browse/BEAM-10962
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-py-gcp
>            Reporter: Inigo San Jose Visiers
>            Priority: P2
>          Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> A very common use case in Dataflow / Beam is reading from multiple PubSub topics/subscriptions and flatten them out. 
> I would be nice to have a PTransform that does this without users having to built it themselves.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)