You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/03 23:47:57 UTC

[GitHub] [beam] kennknowles opened a new issue, #19345: Python FileBasedCacheManager does not respect PCoder for PCollection being cached

kennknowles opened a new issue, #19345:
URL: https://github.com/apache/beam/issues/19345

   FileBasedCacheManager used by Python's InteractiveRunner does not preserve PCoder for elements of a PCollection being cached on disk. I suggest that the cache on-disk format to be changed to TFRecords (which are supported by Beam) and FileBasedCacheManager would store the desired PCoder for cached collections.
   Currently, it is not possible to work with dynamically-generated protocol buffer messages in interactive runner mode because of pickling errors.
   
   Imported from Jira [BEAM-6356](https://issues.apache.org/jira/browse/BEAM-6356). Original Jira may contain additional context.
   Reported by: leontyev_google.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org