You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "alippai (via GitHub)" <gi...@apache.org> on 2023/03/07 20:54:24 UTC
[GitHub] [arrow] alippai opened a new issue, #34491: [Python] Add pyarrow.TableGroupBy() subtables
alippai opened a new issue, #34491:
URL: https://github.com/apache/arrow/issues/34491
### Describe the enhancement requested
```python
import pyarrow as pa
t = pa.table([
pa.array(["a", "a", "b", "b", "c"]),
pa.array([1, 2, 3, 4, 5]),
], names=["keys", "values"])
for subtable in pa.TableGroupBy(t,"keys"): # loop over the 3 tables
subtable.to_pandas() # with the keys column
# alternatively
for key_name, subtable in pa.TableGroupBy(t,"keys"): # loop over the 3 tables
subtable.to_pandas() # without the keys column
```
### Component(s)
Python
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] jorisvandenbossche commented on issue #34491: [Python] Add pyarrow.TableGroupBy() subtables
Posted by "jorisvandenbossche (via GitHub)" <gi...@apache.org>.
jorisvandenbossche commented on issue #34491:
URL: https://github.com/apache/arrow/issues/34491#issuecomment-1473513463
Some related discussion at https://ursalabs.zulipchat.com/#narrow/stream/180245-dev/topic/Acero.20-.20GroupingSegmenter/near/342103440
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org