You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 21:40:26 UTC

[GitHub] [beam] damccorm opened a new issue, #21131: Update to_dataframe API Docs to focus on schema use

damccorm opened a new issue, #21131:
URL: https://github.com/apache/beam/issues/21131

   The API documentation for to_dataframe (https://beam.apache.org/releases/pydoc/2.32.0/apache_beam.dataframe.convert.html#apache_beam.dataframe.convert.to_dataframe) is very sparse. It also focuses on specifying a proxy, rather than relying a schema-aware PCollection as an input. This function is often people's entrypoint into the API, so we should make it very clear how to use it. Let's expand the documentation, and focus on using schemas rather than specifying a proxy.
   
   We should also link to the documentation for to_dataframe in https://beam.apache.org/documentation/dsls/dataframes/overview/#embedding-dataframes-in-a-pipeline
   
   Imported from Jira [BEAM-13002](https://issues.apache.org/jira/browse/BEAM-13002). Original Jira may contain additional context.
   Reported by: bhulette.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] TheNeuralBit commented on issue #21131: Update to_dataframe API Docs to focus on schema use

Posted by GitBox <gi...@apache.org>.
TheNeuralBit commented on issue #21131:
URL: https://github.com/apache/beam/issues/21131#issuecomment-1189225795

   See https://stackoverflow.com/questions/72996664 for an example of confusion this causes. proxy argument is being used rather than declaring a schema.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org