You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "Fokko (via GitHub)" <gi...@apache.org> on 2023/04/11 17:26:45 UTC
[GitHub] [iceberg] Fokko commented on pull request #6879: Python: Liberal Timestamp parsing
Fokko commented on PR #6879:
URL: https://github.com/apache/iceberg/pull/6879#issuecomment-1503811480
@rdblue I've added it to the config. This way you can also do:
```python
cat = load_catalog('prod', 'timezone'='+01:00')
```
Now we also get warnings:
```
In [1]: from pyiceberg.catalog import load_catalog
...: from pyiceberg.expressions import GreaterThanOrEqual, LessThan, And
...:
...: cat = load_catalog('prod')
...:
...: tbl = cat.load_table(('examples', 'nyc_taxi_yellow'))
...:
...:
...: tbl.scan(row_filter=And(GreaterThanOrEqual("pickup_time", "2022-01-01T00:00:00"), GreaterThanOrEqual("dropoff_time", "2022-01-01T00:00:00"))).to_arrow()
...:
/Users/fokkodriesprong/Desktop/iceberg/python/pyiceberg/expressions/visitors.py:1451: UserWarning: Assuming timezone +00:00 for: GreaterThanOrEqual(term=Reference(name='pickup_time'), literal=literal('2022-01-01T00:00:00'))
warnings.warn(f"Assuming timezone {self.default_timezone} for: {predicate}")
/Users/fokkodriesprong/Desktop/iceberg/python/pyiceberg/expressions/visitors.py:1451: UserWarning: Assuming timezone +00:00 for: GreaterThanOrEqual(term=Reference(name='dropoff_time'), literal=literal('2022-01-01T00:00:00'))
warnings.warn(f"Assuming timezone {self.default_timezone} for: {predicate}")
Out[1]:
pyarrow.Table
vendor_id: int32
pickup_time: timestamp[us, tz=UTC]
pickup_location_id: int32
dropoff_time: timestamp[us, tz=UTC]
dropoff_location_id: int32
passenger_count: int32
trip_distance: double
ratecode_id: int32
payment_type: int32
total_amount: double
fare_amount: double
tip_amount: double
tolls_amount: double
mta_tax: double
improvement_surcharge: double
congestion_surcharge: double
extra_surcharges: double
store_and_forward_flag: string
----
vendor_id: [[2],[2],...,[2,2,2,2],[2,2]]
pickup_time: [[2029-05-05 15:37:39.000000],[2026-12-17 07:38:40.000000],...,[2038-02-18 04:41:09.000000,2038-02-18 04:36:24.000000,2038-02-18 05:46:29.000000,2038-02-18 05:55:54.000000],[2088-01-24 08:15:42.000000,2088-01-24 08:25:39.000000]]
pickup_location_id: [[231],[132],...,[262,263,249,113],[41,24]]
dropoff_time: [[2029-05-05 19:18:20.000000],[2026-12-17 08:09:21.000000],...,[2038-02-18 08:03:11.000000,2038-02-18 04:39:22.000000,2038-02-18 05:53:20.000000,2038-02-19 05:13:21.000000],[2088-01-24 08:19:46.000000,2088-01-24 15:28:25.000000]]
dropoff_location_id: [[249],[79],...,[140,262,114,239],[166,162]]
passenger_count: [[1],[1],...,[1,1,1,1],[1,1]]
trip_distance: [[1.69],[18.98],...,[0.56,0.45,0.86,5.07],[0.63,4.05]]
ratecode_id: [[1],[2],...,[1,1,1,1],[1,1]]
payment_type: [[1],[2],...,[1,2,2,2],[2,2]]
total_amount: [[12.96],[55.3],...,[9.36,7.8,9.3,21.8],[5.3,15.3]]
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org