You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 21:42:36 UTC

[GitHub] [beam] damccorm opened a new issue, #21161: Versions after `2.28.0` fail to infer grouping decoders after a date is selected from a data structure

damccorm opened a new issue, #21161:
URL: https://github.com/apache/beam/issues/21161

   The below code throws this type error on the effected versions, and merely works as expected on 2.28.0:
   
   `TypeError: Unable to deterministically encode '2021-11-02' of type '<class 'datetime.date'\>', please provide a type hint for the input of 'GroupByKey' [while running 'Create/Map(decode)']`
   
   ```
   
   import typing
   from datetime import date
   import apache_beam as beam
   from apache_beam.testing.test_pipeline
   import TestPipeline
   
   with TestPipeline() as pipeline:
   	today = date.today()
   	results = (
   		pipeline
   		|
   beam.Create([(1, { 'd': today }), (1, { 'd': today })])
   		| beam.MapTuple(lambda i, d: (d['d'], i))
   # <-- this step only requires output type hints on versions after 2.28.0, and only if the date is being
   "projected" from some other data structure
   		| beam.CombinePerKey(sum) # <-- if this aggregation is
   removed, the pipeline also works without error
   	)
   
   	results | beam.Map(print)
   
   ```
   
   
   This stackoverflow issue is having the same problem:
   https://stackoverflow.com/questions/69409693/how-do-i-use-a-datetime-date-value-in-apache-beam-groupby
   
   It's possible to fix the errors by registering a `DateCoder` and adding output type hints to the projection `MapTuple` step, but since this isn't necessary in other situations and versions, it seems this is a bug. Our production pipelines will need to add many of these tedious type hints in order to work properly, so we're effectively blocked from upgrading to the newest version.
   
   Imported from Jira [BEAM-13166](https://issues.apache.org/jira/browse/BEAM-13166). Original Jira may contain additional context.
   Reported by: blaine.hansen@marketdial.com.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org