You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Sourabh Bajaj (JIRA)" <ji...@apache.org> on 2017/03/15 21:49:41 UTC

[jira] [Created] (BEAM-1731) RuntimeError when running wordcount with ValueProviders

Sourabh Bajaj created BEAM-1731:
-----------------------------------

             Summary: RuntimeError when running wordcount with ValueProviders
                 Key: BEAM-1731
                 URL: https://issues.apache.org/jira/browse/BEAM-1731
             Project: Beam
          Issue Type: Bug
          Components: sdk-py
            Reporter: Sourabh Bajaj
            Assignee: MarĂ­a GH


Running: python -m apache_beam.examples.wordcount

INFO:root:Job 2017-03-15_13_39_59-3092873759767386 is in state JOB_STATE_FAILED
Traceback (most recent call last):
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py", line 162, in _run_module_as_main
    "__main__", fname, loader, pkg_name)
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py", line 72, in _run_code
    exec code in run_globals
  File "/Users/sourabhbajaj/Projects/incubator-beam/sdks/python/apache_beam/examples/wordcount.py", line 119, in <module>
    run()
  File "/Users/sourabhbajaj/Projects/incubator-beam/sdks/python/apache_beam/examples/wordcount.py", line 109, in run
    result.wait_until_finish()
  File "apache_beam/runners/dataflow/dataflow_runner.py", line 711, in wait_until_finish
    (self.state, getattr(self._runner, 'last_error_msg', None)), self)
apache_beam.runners.dataflow.dataflow_runner.DataflowRuntimeException: Dataflow pipeline failed. State: FAILED, Error:
(e22fabbb61bfae00): Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/batchworker.py", line 544, in do_work
    work_executor.execute()
  File "dataflow_worker/executor.py", line 1013, in dataflow_worker.executor.CustomSourceSplitExecutor.execute (dataflow_worker/executor.c:31501)
    self.response = self._perform_source_split_considering_api_limits(
  File "dataflow_worker/executor.py", line 1021, in dataflow_worker.executor.CustomSourceSplitExecutor._perform_source_split_considering_api_limits (dataflow_worker/executor.c:31703)
    split_response = self._perform_source_split(source_operation_split_task,
  File "dataflow_worker/executor.py", line 1059, in dataflow_worker.executor.CustomSourceSplitExecutor._perform_source_split (dataflow_worker/executor.c:32341)
    for split in source.split(desired_bundle_size):
  File "/usr/local/lib/python2.7/dist-packages/apache_beam/io/filebasedsource.py", line 192, in split
    return self._get_concat_source().split(
  File "/usr/local/lib/python2.7/dist-packages/apache_beam/utils/value_provider.py", line 105, in _f
    raise RuntimeError('%s not accessible' % obj)
RuntimeError: RuntimeValueProvider(option: input, type: str, default_value: 'gs://dataflow-samples/shakespeare/kinglear.txt') not accessible




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)