You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "jorisvandenbossche (via GitHub)" <gi...@apache.org> on 2023/03/23 08:24:47 UTC
[GitHub] [arrow] jorisvandenbossche opened a new issue, #34701: [Python] Expose Run-End Encoded arrays in PyArrow
jorisvandenbossche opened a new issue, #34701:
URL: https://github.com/apache/arrow/issues/34701
The initial bindings for Run-End Encoded arrays (the array and type classes) landed in https://github.com/apache/arrow/pull/34570. Some follow-ups:
- [ ] https://github.com/apache/arrow/issues/34686
- [ ] Support construction in `pa.array(..)`
- [ ] Conversion pyarrow<->numpy/pandas
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] felipecrv commented on issue #34701: [Python] Expose Run-End Encoded arrays in PyArrow
Posted by "felipecrv (via GitHub)" <gi...@apache.org>.
felipecrv commented on issue #34701:
URL: https://github.com/apache/arrow/issues/34701#issuecomment-1579030034
`RunEndEncoded` is one of the types listed here as "NotImplemented". No specific reason, just a matter of taking the time to write and test the implementation.
https://github.com/apache/arrow/blob/main/python/pyarrow/src/arrow/python/arrow_to_pandas.cc#L1348
The list of things to implement for a new format is huge and this is one of the things that I didn't have time to complete yet.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
Re: [I] [Python] Expose Run-End Encoded arrays in PyArrow [arrow]
Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF closed issue #34701: [Python] Expose Run-End Encoded arrays in PyArrow
URL: https://github.com/apache/arrow/issues/34701
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
Re: [I] [Python] Expose Run-End Encoded arrays in PyArrow [arrow]
Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF commented on issue #34701:
URL: https://github.com/apache/arrow/issues/34701#issuecomment-2019568037
Closing this issue as all subtasks are completed.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] jorisvandenbossche commented on issue #34701: [Python] Expose Run-End Encoded arrays in PyArrow
Posted by "jorisvandenbossche (via GitHub)" <gi...@apache.org>.
jorisvandenbossche commented on issue #34701:
URL: https://github.com/apache/arrow/issues/34701#issuecomment-1578938916
Example of the current behaviour for conversion to numpy/pandas:
```
In [36]: arr = pc.run_end_encode([1, 2, 2, 3, 3, 3])
In [37]: arr.to_numpy()
---------------------------------------------------------------------------
ArrowNotImplementedError Traceback (most recent call last)
Cell In [37], line 1
----> 1 arr.to_numpy()
File pyarrow/array.pxi:1530, in pyarrow.lib.Array.to_numpy()
File pyarrow/error.pxi:121, in pyarrow.lib.check_status()
ArrowNotImplementedError: No known equivalent Pandas block for Arrow data of type run_end_encoded<run_ends: int32, values: int64> is known.
In [38]: arr.to_pandas()
---------------------------------------------------------------------------
ArrowNotImplementedError Traceback (most recent call last)
Cell In [38], line 1
----> 1 arr.to_pandas()
File pyarrow/array.pxi:855, in pyarrow.lib._PandasConvertible.to_pandas()
File pyarrow/array.pxi:1478, in pyarrow.lib.Array._to_pandas()
File pyarrow/array.pxi:1682, in pyarrow.lib._array_like_to_pandas()
File pyarrow/error.pxi:121, in pyarrow.lib.check_status()
ArrowNotImplementedError: No known equivalent Pandas block for Arrow data of type run_end_encoded<run_ends: int32, values: int64> is known.
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org