You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/12/01 15:08:00 UTC

[jira] [Work logged] (BEAM-12865) Allow customising batch duration when streaming with WriteToBigQuery

     [ https://issues.apache.org/jira/browse/BEAM-12865?focusedWorklogId=688727&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688727 ]

ASF GitHub Bot logged work on BEAM-12865:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Dec/21 15:07
            Start Date: 01/Dec/21 15:07
    Worklog Time Spent: 10m 
      Work Description: quentin-sommer commented on pull request #15489:
URL: https://github.com/apache/beam/pull/15489#issuecomment-983733252


   I don't understand why the pre commit script is failing, I don't see what test the stacktrace is coming from. I need help figuring it out... Still very interesting into getting this merged


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 688727)
    Time Spent: 10h 10m  (was: 10h)

> Allow customising batch duration when streaming with WriteToBigQuery
> --------------------------------------------------------------------
>
>                 Key: BEAM-12865
>                 URL: https://issues.apache.org/jira/browse/BEAM-12865
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-py-gcp
>    Affects Versions: Not applicable
>            Reporter: Quentin Sommer
>            Priority: P2
>             Fix For: Not applicable
>
>          Time Spent: 10h 10m
>  Remaining Estimate: 0h
>
> Hi,
> We allow customising the {{batch_size}} when streaming to BigQuery but the batch duration (used by {{GroupIntoBatches}}) is set to {{DEFAULT_BATCH_BUFFERING_DURATION_LIMIT_SEC}} (0.2)
> I'd like to add the option to specify the batch duration to allow better batching for scenarios with little data throughput.
> It will use the {{triggering_frequency}} param already used when doing batch file loads



--
This message was sent by Atlassian Jira
(v8.20.1#820001)