You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Corvin Deboeser (Jira)" <ji...@apache.org> on 2020/05/12 07:00:00 UTC

[jira] [Updated] (BEAM-9960) Python MongoDBIO fails when response of split vector command is larger than 16mb

     [ https://issues.apache.org/jira/browse/BEAM-9960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Corvin Deboeser updated BEAM-9960:
----------------------------------
    Description: 
When using MongoDBIO on a large collection and the source bundle size was determined to be 1, then the response from the split vector command can be larger than 16mb which is not supported by pymongo / MongoDB:

{{pymongo.errors.ProtocolError: Message length (33699186) is larger than server max message size (33554432)}}

 

Environment: Was running this on Google Dataflow / Beam Python SDK 2.20.

  was:
When using MongoDBIO on a large collection and the source bundle size was determined to be 1, then the response from the split vector command can be larger than 16mb which is not supported by pymongo / MongoDB:

{{pymongo.errors.ProtocolError: Message length (33699186) is larger than server max message size (33554432)}}

 

 

 


> Python MongoDBIO fails when response of split vector command is larger than 16mb
> --------------------------------------------------------------------------------
>
>                 Key: BEAM-9960
>                 URL: https://issues.apache.org/jira/browse/BEAM-9960
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core
>    Affects Versions: 2.20.0
>            Reporter: Corvin Deboeser
>            Priority: Major
>
> When using MongoDBIO on a large collection and the source bundle size was determined to be 1, then the response from the split vector command can be larger than 16mb which is not supported by pymongo / MongoDB:
> {{pymongo.errors.ProtocolError: Message length (33699186) is larger than server max message size (33554432)}}
>  
> Environment: Was running this on Google Dataflow / Beam Python SDK 2.20.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)