You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Brian Hulette (Jira)" <ji...@apache.org> on 2022/03/30 19:23:00 UTC
[jira] [Commented] (BEAM-14199) Expose dataframes/pandas top-level functions.
[ https://issues.apache.org/jira/browse/BEAM-14199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514906#comment-17514906 ]
Brian Hulette commented on BEAM-14199:
--------------------------------------
Thanks we can definitely expose this in a more useful way. As with most things CS the hard part is naming...
Should we just put these functions (along with other utilities like to_dataframe, to_pcollection) in the top-level dataframe? Then users could do
{code}
from apache_beam import dataframe as beam_pd
{code}
> Expose dataframes/pandas top-level functions.
> ---------------------------------------------
>
> Key: BEAM-14199
> URL: https://issues.apache.org/jira/browse/BEAM-14199
> Project: Beam
> Issue Type: Improvement
> Components: sdk-py-core
> Reporter: Robert Bradshaw
> Assignee: Brian Hulette
> Priority: P2
>
> Right now a user needs to do something like
> from apache_beam.dataframe.pandas_top_level_functions import pd_wrapper as beam_pd
> to use top-level functions like merge, which is undocumented and unintuitive.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)