You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 14:38:04 UTC

[GitHub] [beam] damccorm opened a new issue, #19973: Move Chicago Taxi Example to Python 3

damccorm opened a new issue, #19973:
URL: https://github.com/apache/beam/issues/19973

   The Chicago Taxi Example[1] should be moved to the latest version of Python supported by Beam (currently it's Python 3.7).
   
   At the moment, the following error occurs when running the benchmark on Python 3.7 (requires futher investigation):
   ```
   
   Traceback (most recent call last):
     File "preprocess.py", line 259, in <module>
       main()
     File
   "preprocess.py", line 254, in main
       project=known_args.metric_reporting_project
     File "preprocess.py",
   line 155, in transform_data
       ('Analyze' >> tft_beam.AnalyzeDataset(preprocessing_fn)))
     File "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/transforms/ptransform.py",
   line 987, in __ror__
       return self.transform.__ror__(pvalueish, self.label)
     File "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/transforms/ptransform.py",
   line 547, in __ror__
       result = p.apply(self, pvalueish, label)
     File "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/pipeline.py",
   line 532, in apply
       return self.apply(transform, pvalueish)
     File "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/pipeline.py",
   line 573, in apply
       pvalueish_result = self.runner.apply(transform, pvalueish, self._options)
   
    File "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/runners/runner.py", line 193, in apply
   
      return m(transform, input, options)
     File "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/runners/runner.py",
   line 223, in apply_PTransform
       return transform.expand(input)
     File "/Users/kamilwasilewski/proj/beam/build/gradleenv/2022703441/lib/python3.7/site-packages/tensorflow_transform/beam/impl.py",
   line 825, in expand
       input_metadata))
     File "/Users/kamilwasilewski/proj/beam/build/gradleenv/2022703441/lib/python3.7/site-packages/tensorflow_transform/beam/impl.py",
   line 716, in expand
       output_signature = self._preprocessing_fn(copied_inputs)
     File "preprocess.py",
   line 102, in preprocessing_fn
       _fill_in_missing(inputs[key]),
   KeyError: 'company'
   
   ```
   
   [1] sdks/python/apache_beam/testing/benchmarks/chicago_taxi
   
   Imported from Jira [BEAM-9154](https://issues.apache.org/jira/browse/BEAM-9154). Original Jira may contain additional context.
   Reported by: kamilwu.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org