You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Martin Nowak (Jira)" <ji...@apache.org> on 2021/03/20 09:59:00 UTC

[jira] [Created] (ARROW-12031) [C++] infer CSV timestamps columns with fractional seconds

Martin Nowak created ARROW-12031:
------------------------------------

             Summary: [C++] infer CSV timestamps columns with fractional seconds
                 Key: ARROW-12031
                 URL: https://issues.apache.org/jira/browse/ARROW-12031
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
    Affects Versions: 3.0.0
            Reporter: Martin Nowak


After implementing fractional second parsing in and ARROW-9107 and ARROW-10337, no support for inference of subsecond timestamps was added (see [code]([https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/inference_internal.h#L115), also mentioned in ARROW-3738).

The [InferState logic | https://github.com/apache/arrow/blob/81d6724546559fa8d6e85376a1cc339dab0506ad/cpp/src/arrow/csv/inference_internal.h#L63] could test for TimestampNS (with nanosecond resolution) after Timestamps with second resolution to not change existing inference results.

Wouldn't expect too much interest in trying to infer timestamps with milli- and microsecond resolution. The nanosecond resolution [can already parse |https://github.com/apache/arrow/blob/81d6724546559fa8d6e85376a1cc339dab0506ad/cpp/src/arrow/util/value_parsing.h#L512-L518] the lower resolution ones.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)