You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Michael Bingham (Jira)" <ji...@apache.org> on 2020/11/05 08:30:00 UTC

[jira] [Commented] (KAFKA-10685) --to-datetime passed to kafka-consumer-groups interpreting microseconds wrong

    [ https://issues.apache.org/jira/browse/KAFKA-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17226576#comment-17226576 ] 

Michael Bingham commented on KAFKA-10685:
-----------------------------------------

This issue seems to occur with Java 11, but not with Java 8. Code currently uses {{SimpleDateFormat}}. {{DateTimeFormatter}} behaved consistently across Java 8 and 11 (throwing a parse exception if milliseconds is more than 3 digits) - maybe worth considering changing to:
{code:java}
DateTimeFormatter.ofPattern("yyyy-MM-dd'T'HH:mm:ss.SSSXXX")
{code}
 

 

> --to-datetime passed to kafka-consumer-groups interpreting microseconds wrong
> -----------------------------------------------------------------------------
>
>                 Key: KAFKA-10685
>                 URL: https://issues.apache.org/jira/browse/KAFKA-10685
>             Project: Kafka
>          Issue Type: Bug
>    Affects Versions: 2.7.0
>            Reporter: Russell Sayers
>            Assignee: Luke Chen
>            Priority: Minor
>
> If you pass more than 3 decimal places for the fractional seconds of the datetime, the microseconds get interpreted as milliseconds.
> {{kafka-consumer-groups --bootstrap-server kafka:9092 }}
> {{--reset-offsets }}
> {{--group webserver-avro }}
> {{--topic driver-positions-avro }}
> {{ {{--to-datetime "}}{{2020-11-05T00:46:48.002237400}}" }}
> {{ {{--dry-run}}}}
> Relevant code [here|https://github.com/apache/kafka/blob/2.7/clients/src/main/java/org/apache/kafka/common/utils/Utils.java#L1304]. The datetime is being turned into Nov 5, 2020 1:24:05.400 because SimpleDateFormat is adding 2237400 milliseconds to Nov 5, 2020 00:46:48.
> Experimenting with getDateTime:
>  * getDateTime("2020-11-05T00:46:48.000") -> 1604537208000
>  * getDateTime("2020-11-05T00:46:48.000+0800") -> 1604508408000 - correct the formatting string allows for ZZZ timezones
>  * getDateTime("2020-11-05T00:46:48.000123") -> 1604537208123 - note this ends with 123 milliseconds.
> The pattern string is "yyyy-MM-dd'T'HH:mm:ss.SSS".  So SimpleDateFormat interprets "000123" as 123 milliseconds. See the stackoverflow answer [here|https://stackoverflow.com/a/21235602/109102].
> The fix?  Remove any digits after more than 3 characters after the decimal point, or raise an exception. The code would still need to allow the RFC822 timezone, i.e Sign TwoDigitHours Minutes.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)