You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Dewey Dunnington (Jira)" <ji...@apache.org> on 2021/12/03 13:54:00 UTC

[jira] [Commented] (ARROW-13168) [C++] Timezone database configuration and access

    [ https://issues.apache.org/jira/browse/ARROW-13168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17453018#comment-17453018 ] 

Dewey Dunnington commented on ARROW-13168:
------------------------------------------

Just a +1 for a "runtime configuration" option. In R we have the [tzdb package|]. Currently it only provides the text format of the IANA database but we could use that approach if we need something different (maintained sepaerately to keep it up to date). I'm less familiar with Python but I imagine something similar exist there, too.

{code:R}
list.files(tzdb::tzdb_path("text"))
#>  [1] "africa"            "antarctica"        "asia"             
#>  [4] "australasia"       "backward"          "backzone"         
#>  [7] "calendars"         "checklinks.awk"    "checktab.awk"     
#> [10] "CONTRIBUTING"      "etcetera"          "europe"           
#> [13] "factory"           "iso3166.tab"       "leap-seconds.list"
#> [16] "leapseconds"       "leapseconds.awk"   "LICENSE"          
#> [19] "Makefile"          "NEWS"              "northamerica"     
#> [22] "README"            "southamerica"      "theory.html"      
#> [25] "version"           "windowsZones.xml"  "ziguard.awk"      
#> [28] "zishrink.awk"      "zone.tab"          "zone1970.tab"     
#> [31] "zoneinfo2tdf.pl"
{code}


> [C++] Timezone database configuration and access
> ------------------------------------------------
>
>                 Key: ARROW-13168
>                 URL: https://issues.apache.org/jira/browse/ARROW-13168
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Rok Mihevc
>            Priority: Major
>              Labels: timestamp
>
>  Note: currently timezone database is not available on windows so timezone aware operations will fail.
> We're using tz.h library which needs an updated timezone database to correctly handle timezoned timestamps. See [installation instructions|https://howardhinnant.github.io/date/tz.html#Installation].
> We have the following options for getting a timezone database:
>  # local (non-windows) OS timezone database - no work required.
>  # arrow bundled folder - we could bundle the database at build time for windows. Database would slowly go stale.
>  # download it from IANA Time Zone Database at runtime - tz.h gets the database at runtime, but curl (and 7-zip on windows) are required.
>  # local user-provided folder - user could provide a location at buildtime. Nice to have.
>  # allow runtime configuration - at runtime say: "the tzdata can be found at this location"
> For more context see: [ARROW-12980|https://github.com/apache/arrow/pull/10457] and [PEP 615|https://www.python.org/dev/peps/pep-0615/#sources-for-time-zone-data].



--
This message was sent by Atlassian Jira
(v8.20.1#820001)