You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2021/10/22 18:53:00 UTC
[jira] [Created] (ARROW-14448) [Python] Update pyarrow.array()
docstring note on timestamp (timezone) conversion
Joris Van den Bossche created ARROW-14448:
---------------------------------------------
Summary: [Python] Update pyarrow.array() docstring note on timestamp (timezone) conversion
Key: ARROW-14448
URL: https://issues.apache.org/jira/browse/ARROW-14448
Project: Apache Arrow
Issue Type: Improvement
Components: Python
Reporter: Joris Van den Bossche
The current {{pa.array(..)}} docstring has the following note:
bq. Localized timestamps will currently be returned as UTC (pandas’s native representation). Timezone-naive data will be implicitly interpreted as UTC.
This is IMO quite confusing. While _technically_ somewhat correct (we do store UTC values under the hood for timezone aware data, and values "as if" in UTC for timezone naive data), it's not what the user actually gets returned. The type of the returned array does preserve the timezone if possible, or will have no timezone (also no UTC) when converted from naive timestamps.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)