You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Brian Hulette (Jira)" <ji...@apache.org> on 2021/01/12 23:35:00 UTC
[jira] [Created] (BEAM-11627) Properly support `convert_dtype=True`
in Series.apply
Brian Hulette created BEAM-11627:
------------------------------------
Summary: Properly support `convert_dtype=True` in Series.apply
Key: BEAM-11627
URL: https://issues.apache.org/jira/browse/BEAM-11627
Project: Beam
Issue Type: Improvement
Components: sdk-py-core
Reporter: Brian Hulette
See https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.apply.html
convert_dtype=True indicates that pandas should observe the output and set the dtype to something other than object if possible. We should intercept this argument and use type inference to set the dtype. We can't rely on pandas' inference since our implementation can't observe the entire dataset.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)