You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Brian Hulette (Jira)" <ji...@apache.org> on 2021/01/12 23:36:00 UTC

[jira] [Updated] (BEAM-11627) Properly support `convert_dtype=True` in Series.apply

     [ https://issues.apache.org/jira/browse/BEAM-11627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brian Hulette updated BEAM-11627:
---------------------------------
    Status: Open  (was: Triage Needed)

> Properly support `convert_dtype=True` in Series.apply
> -----------------------------------------------------
>
>                 Key: BEAM-11627
>                 URL: https://issues.apache.org/jira/browse/BEAM-11627
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-py-core
>            Reporter: Brian Hulette
>            Priority: P2
>              Labels: dataframe-api
>
> See https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.apply.html
> convert_dtype=True indicates that pandas should observe the output and set the dtype to something other than object if possible. We should intercept this argument and use type inference to set the dtype. We can't rely on pandas' inference since our implementation can't observe the entire dataset. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)