You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 12:54:00 UTC

[GitHub] [beam] damccorm opened a new issue, #19721: Add Schema support for Tensorflow

damccorm opened a new issue, #19721:
URL: https://github.com/apache/beam/issues/19721

   Similar to BEAM-5807, Tensorflow's defacto storage format is `TFRecord` files with `Example` proto payload and its own schema.proto. We already have `TFRecordIO` support. Need to implement:
    * Conversion between Beam and TF schema
    * Conversion between Beam Row and TF Example proto
    * `TFRecordTableProvider`
   
   [https://github.com/tensorflow/metadata/blob/master/tensorflow_metadata/proto/v0/schema.proto](https://github.com/tensorflow/metadata/blob/master/tensorflow_metadata/proto/v0/schema.proto)
   
   [https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/example/example.proto](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/example/example.proto)
   
    
   
   Also it seems the metadata protos are not published as Java artifacts:
   
   [https://github.com/tensorflow/metadata/issues/5](https://github.com/tensorflow/metadata/issues/5)
   
   My WIP branch: https://github.com/spotify/beam/tree/neville/tf
   
   Imported from Jira [BEAM-7921](https://issues.apache.org/jira/browse/BEAM-7921). Original Jira may contain additional context.
   Reported by: sinisa_lyh.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org