You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Quang Hoang (Jira)" <ji...@apache.org> on 2022/10/30 12:42:00 UTC
[jira] [Assigned] (ARROW-17506) [Python][C++] pyarrow parquet writer - missing time logical type
[ https://issues.apache.org/jira/browse/ARROW-17506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Quang Hoang reassigned ARROW-17506:
-----------------------------------
Assignee: Quang Hoang
> [Python][C++] pyarrow parquet writer - missing time logical type
> ----------------------------------------------------------------
>
> Key: ARROW-17506
> URL: https://issues.apache.org/jira/browse/ARROW-17506
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Parquet, Python
> Affects Versions: 8.0.0, 9.0.0
> Reporter: Vadim Goy
> Assignee: Quang Hoang
> Priority: Major
>
>
> pyarrow.parquet.write_table used for write Parquet file
> In parquet schema missed logical type for TIME, it’s just long type
> PyArrow Schema
> {code:java}
> NUMBER: int64
> DECIMAL: int64
> NUMERIC: int64
> INT: int64
> FLOAT: double
> VARCHAR: string
> TEXT: string
> CHAR: string
> BOOLEAN: bool
> ARR: string
> VAR: string
> OBJ: string
> TIMESTAMP: timestamp[ns]
> DATE: date64[ms]
> TIME: time64[ns]
> PK: int64
> UUID: binary
> UUID2: string
> UUID3: string {code}
> Parquet schema
> {code:java}
> {
> "type" : "record",
> "name" : "schema",
> "fields" : [ {
> "name" : "NUMBER",
> "type" : [ "null", "long" ],
> "default" : null
> }, {
> "name" : "DECIMAL",
> "type" : [ "null", "long" ],
> "default" : null
> }, {
> "name" : "NUMERIC",
> "type" : [ "null", "long" ],
> "default" : null
> }, {
> "name" : "INT",
> "type" : [ "null", "long" ],
> "default" : null
> }, {
> "name" : "FLOAT",
> "type" : [ "null", "double" ],
> "default" : null
> }, {
> "name" : "VARCHAR",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "TEXT",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "CHAR",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "BOOLEAN",
> "type" : [ "null", "boolean" ],
> "default" : null
> }, {
> "name" : "ARR",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "VAR",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "OBJ",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "TIMESTAMP",
> "type" : [ "null", {
> "type" : "long",
> "logicalType" : "timestamp-micros"
> } ],
> "default" : null
> }, {
> "name" : "DATE",
> "type" : [ "null", {
> "type" : "int",
> "logicalType" : "date"
> } ],
> "default" : null
> }, {
> "name" : "TIME",
> "type" : [ "null", "long" ],
> "default" : null
> }, {
> "name" : "PK",
> "type" : [ "null", "long" ],
> "default" : null
> }, {
> "name" : "UUID",
> "type" : [ "null", "bytes" ],
> "default" : null
> }, {
> "name" : "UUID2",
> "type" : [ "null", "string" ],
> "default" : null
> }, {
> "name" : "UUID3",
> "type" : [ "null", "string" ],
> "default" : null
> } ]
> }{code}
>
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)