You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Brian Hulette (Jira)" <ji...@apache.org> on 2022/03/30 19:23:00 UTC

[jira] [Commented] (BEAM-14199) Expose dataframes/pandas top-level functions.

    [ https://issues.apache.org/jira/browse/BEAM-14199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514906#comment-17514906 ] 

Brian Hulette commented on BEAM-14199:
--------------------------------------

Thanks we can definitely expose this in a more useful way. As with most things CS the hard part is naming...

Should we just put these functions (along with other utilities like to_dataframe, to_pcollection) in the top-level dataframe? Then users could do

{code}
from apache_beam import dataframe as beam_pd
{code}

> Expose dataframes/pandas top-level functions.
> ---------------------------------------------
>
>                 Key: BEAM-14199
>                 URL: https://issues.apache.org/jira/browse/BEAM-14199
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-py-core
>            Reporter: Robert Bradshaw
>            Assignee: Brian Hulette
>            Priority: P2
>
> Right now a user needs to do something like
> from apache_beam.dataframe.pandas_top_level_functions import pd_wrapper as beam_pd
> to use top-level functions like merge, which is undocumented and unintuitive. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)