You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Pablo Estrada (Jira)" <ji...@apache.org> on 2020/11/21 05:59:00 UTC

[jira] [Created] (BEAM-11317) SDF BoundedSource consumer is ont able to split data from the runner

Pablo Estrada created BEAM-11317:
------------------------------------

             Summary: SDF BoundedSource consumer is ont able to split data from the runner
                 Key: BEAM-11317
                 URL: https://issues.apache.org/jira/browse/BEAM-11317
             Project: Beam
          Issue Type: Improvement
          Components: io-py-common
            Reporter: Pablo Estrada
            Assignee: Pablo Estrada


This is affecting BigQuery jobs that export into a small number of files and have a large overhead per element in the same stage.

This makes stages slow to execute and unable to split work - so jobs end up being very long. Should look into this.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)