You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "Fokko (via GitHub)" <gi...@apache.org> on 2023/02/27 08:39:39 UTC

[GitHub] [iceberg] Fokko opened a new issue, #6945: Python: Inconsistency around timezones

Fokko opened a new issue, #6945:
URL: https://github.com/apache/iceberg/issues/6945

   ### Apache Iceberg version
   
   main (development)
   
   ### Query engine
   
   Other
   
   ### Please describe the bug 🐞
   
   With PyIceberg when we filter a complete DataFile, we end up with:
   ```
   ArrowInvalid: Schema at index 1 was different: 
   vendor_id: int32
   pickup_time: timestamp[us, tz=+00:00]
   pickup_location_id: int32
   dropoff_time: timestamp[us, tz=+00:00]
   dropoff_location_id: int32
   passenger_count: int32
   trip_distance: double
   ratecode_id: int32
   payment_type: int32
   total_amount: double
   fare_amount: double
   tip_amount: double
   tolls_amount: double
   mta_tax: double
   improvement_surcharge: double
   congestion_surcharge: double
   extra_surcharges: double
   store_and_forward_flag: string
   vs
   vendor_id: int32
   pickup_time: timestamp[us, tz=UTC]
   pickup_location_id: int32
   dropoff_time: timestamp[us, tz=UTC]
   dropoff_location_id: int32
   passenger_count: int32
   trip_distance: double
   ratecode_id: int32
   payment_type: int32
   total_amount: double
   fare_amount: double
   tip_amount: double
   tolls_amount: double
   mta_tax: double
   improvement_surcharge: double
   congestion_surcharge: double
   extra_surcharges: double
   store_and_forward_flag: string
   ```
   
   We get a `+00:00` from the empty tables that we're `concat`'ing, and a `UTC` from the ones that actually contain data:
   ![image](https://user-images.githubusercontent.com/1134248/221514436-9ad1e256-9567-4e4a-8524-8acb8ed62b77.png)
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] Fokko commented on issue #6945: Python: Inconsistency around timezones

Posted by "Fokko (via GitHub)" <gi...@apache.org>.
Fokko commented on issue #6945:
URL: https://github.com/apache/iceberg/issues/6945#issuecomment-1445924812

   Maybe we should also change the default timezone to UTC, since I think it is more common to have `UTC` as default, instead of `+00:00`:
   ![image](https://user-images.githubusercontent.com/1134248/221516071-a7562861-0e69-4c0e-8cc3-457711ddcea9.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] Fokko closed issue #6945: Python: Inconsistency around timezones

Posted by "Fokko (via GitHub)" <gi...@apache.org>.
Fokko closed issue #6945: Python: Inconsistency around timezones
URL: https://github.com/apache/iceberg/issues/6945


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org