You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Gaurav Rawat (Jira)" <ji...@apache.org> on 2021/04/12 17:49:00 UTC

[jira] [Commented] (BEAM-11037) Python DirectRunner InputStream Issues (RuntimeError: VarLong too long)

    [ https://issues.apache.org/jira/browse/BEAM-11037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17319624#comment-17319624 ] 

Gaurav Rawat commented on BEAM-11037:
-------------------------------------

stuck with the same issue on my envn as well . windows 10 and python 3.7 . Any solution or workaround to this [~kenn] [~Stauffer]

> Python DirectRunner InputStream Issues (RuntimeError: VarLong too long)
> -----------------------------------------------------------------------
>
>                 Key: BEAM-11037
>                 URL: https://issues.apache.org/jira/browse/BEAM-11037
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-direct, sdk-py-core
>    Affects Versions: 2.24.0
>         Environment: Windows 10 64bit, Python 3.7.9
>            Reporter: Michael Stauffer
>            Priority: P2
>
> I have a Beam workflow that runs fine on Dataflow. However, when I run it on my local direct runner (Windows 10, Python SDK), I run into some strange behaviour if I have a negative number in my PCollection (see below for an example).
> Minimal code example to reproduce the exception:
> {code:java}
> from apache_beam.options.pipeline_options import PipelineOptions
> from apache_beam.pipeline import Pipeline
> import apache_beam
> if __name__ == '__main__':
>     pipeline_options = PipelineOptions()
>     with Pipeline(options=pipeline_options) as p:
>         lines = (
>                 p
>                 | apache_beam.Create([
>                     {"Test": -1}
>                 ]))
>         lines | apache_beam.io.WriteToText('./lines')
> {code}
> Exception:
> {code:java}
>   File "apache_beam\coders\coder_impl.py", line 222, in apache_beam.coders.coder_impl.StreamCoderImpl.decode
>   File "apache_beam\coders\coder_impl.py", line 224, in apache_beam.coders.coder_impl.StreamCoderImpl.decode
>   File "apache_beam\coders\coder_impl.py", line 470, in apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream
>   File "apache_beam\coders\coder_impl.py", line 450, in apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream
>   File "apache_beam\coders\stream.pyx", line 193, in apache_beam.coders.stream.InputStream.read_var_int64
> RuntimeError: VarLong too long. [while running 'Create/Map(decode)']
> {code}
>  Any help is highly appreciated.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)