You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Florian Baumert (JIRA)" <ji...@apache.org> on 2018/11/14 10:14:00 UTC

[jira] [Created] (BEAM-6061) When using unbounded PCollection from TextIO to BigQuery, data is stuck in Reshuffle/GroupByKey inside of BigQueryIO

Florian Baumert created BEAM-6061:
-------------------------------------

             Summary: When using unbounded PCollection from TextIO to BigQuery, data is stuck in Reshuffle/GroupByKey inside of BigQueryIO
                 Key: BEAM-6061
                 URL: https://issues.apache.org/jira/browse/BEAM-6061
             Project: Beam
          Issue Type: Bug
          Components: io-java-gcp
    Affects Versions: 2.8.0
            Reporter: Florian Baumert
            Assignee: Chamikara Jayalath


As a short summary, when reading from TextIO with watchForNewFiles (unbounded collection), BigQueryIO in stream mode, BigQueryIO does not write data. It is "stuck" in a GroupByKey internal to Reshuffle.

 

The issue is detailed in stackoverflow with code and version information.

[https://stackoverflow.com/questions/53266689/when-using-unbounded-pcollection-from-textio-to-bigquery-data-is-stuck-in-reshu]

 

 Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)