You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/05/24 13:35:00 UTC
[jira] [Work logged] (HIVE-25093) date_format() UDF is returning output in UTC time zone only
[ https://issues.apache.org/jira/browse/HIVE-25093?focusedWorklogId=774007&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-774007 ]
ASF GitHub Bot logged work on HIVE-25093:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 24/May/22 13:34
Start Date: 24/May/22 13:34
Worklog Time Spent: 10m
Work Description: zabetak commented on PR #2252:
URL: https://github.com/apache/hive/pull/2252#issuecomment-1135932103
I was really surprised to see that this PR which is around dates had changes in `HiveConnection` class. It appears that those were reverted in https://github.com/apache/hive/pull/2404 and thankfully there are comments describing the reasoning there. I am adding a comment here in case someone else in the future bumps into the same lines of code.
Issue Time Tracking
-------------------
Worklog Id: (was: 774007)
Time Spent: 50m (was: 40m)
> date_format() UDF is returning output in UTC time zone only
> ------------------------------------------------------------
>
> Key: HIVE-25093
> URL: https://issues.apache.org/jira/browse/HIVE-25093
> Project: Hive
> Issue Type: Bug
> Components: UDF
> Affects Versions: 3.1.2
> Reporter: Ashish Sharma
> Assignee: Ashish Sharma
> Priority: Minor
> Labels: pull-request-available, timestamp
> Fix For: 4.0.0, 4.0.0-alpha-1
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> *HIVE - 1.2*
> sshuser@hn0-dateti:~$ *timedatectl*
> Local time: Thu 2021-05-06 11:56:08 IST
> Universal time: Thu 2021-05-06 06:26:08 UTC
> RTC time: Thu 2021-05-06 06:26:08
> Time zone: Asia/Kolkata (IST, +0530)
> Network time on: yes
> NTP synchronized: yes
> RTC in local TZ: no
> sshuser@hn0-dateti:~$ beeline
> 0: jdbc:hive2://localhost:10001/default> *select date_format(current_timestamp,"yyyy-MM-dd HH:mm:ss.SSS z");*
> +------------------------------+--+
> | _c0 |
> +------------------------------+--+
> | 2021-05-06 11:58:53.760 IST |
> +------------------------------+--+
> 1 row selected (1.271 seconds)
> *HIVE - 3.1.0*
> sshuser@hn0-testja:~$ *timedatectl*
> Local time: Thu 2021-05-06 12:03:32 IST
> Universal time: Thu 2021-05-06 06:33:32 UTC
> RTC time: Thu 2021-05-06 06:33:32
> Time zone: Asia/Kolkata (IST, +0530)
> Network time on: yes
> NTP synchronized: yes
> RTC in local TZ: no
> sshuser@hn0-testja:~$ beeline
> 0: jdbc:hive2://zk0-testja.e0mrrixnyxde5h1suy> *select date_format(current_timestamp,"yyyy-MM-dd HH:mm:ss.SSS z");*
> +------------------------------+
> | _c0 |
> +------------------------------+
> | *2021-05-06 06:33:59.078 UTC* |
> +------------------------------+
> 1 row selected (13.396 seconds)
> 0: jdbc:hive2://zk0-testja.e0mrrixnyxde5h1suy> *set hive.local.time.zone=Asia/Kolkata;*
> No rows affected (0.025 seconds)
> 0: jdbc:hive2://zk0-testja.e0mrrixnyxde5h1suy> *select date_format(current_timestamp,"yyyy-MM-dd HH:mm:ss.SSS z");*
> +------------------------------+
> | _c0 |
> +------------------------------+
> | *{color:red}2021-05-06 12:08:15.118 UTC{color}* |
> +------------------------------+
> 1 row selected (1.074 seconds)
> expected result was *2021-05-06 12:08:15.118 IST*
> As part of HIVE-12192 it was decided to have a common time zone for all computation i.e. "UTC". Due to which data_format() function was hard coded to "UTC".
> But later in HIVE-21039 it was decided that user session time zone value should be the default not UTC.
> date_format() was not fixed as part of HIVE-21039.
> what should be the ideal time zone value of date_format().
--
This message was sent by Atlassian Jira
(v8.20.7#820007)