You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Eric Blanco (JIRA)" <ji...@apache.org> on 2018/06/13 11:49:00 UTC
[jira] [Commented] (SPARK-24545) Function hour not working as
expected for hour 2 in PySpark
[ https://issues.apache.org/jira/browse/SPARK-24545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16510999#comment-16510999 ]
Eric Blanco commented on SPARK-24545:
-------------------------------------
Ok apparenlty this is due to a change of hour that was that day form 2 AM to 3 AM in Europe :)
> Function hour not working as expected for hour 2 in PySpark
> -----------------------------------------------------------
>
> Key: SPARK-24545
> URL: https://issues.apache.org/jira/browse/SPARK-24545
> Project: Spark
> Issue Type: Bug
> Components: Java API
> Affects Versions: 2.2.1
> Reporter: Eric Blanco
> Priority: Minor
>
> Hello,
> I tried to get the hour out of a date and it works except if the hour is 2. It works well in Scala but in PySpark it shows hour 3 instead of hour 2.
> Example:
> {code:java}
> from pyspark.sql.functions import *
> columns = ["id","date"]
> vals = [(4,"2016-03-27 02:00:00")]
> df = sqlContext.createDataFrame(vals, columns)
> df.withColumn("hours", hour(col("date"))).show(){code}
> |id|date|hours|
> |4|2016-03-27 2:00:00|3|
> It works as expected for other hours.
> Also, if you change the year or month apparently it works well.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org