You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2022/01/18 20:46:00 UTC
[jira] [Comment Edited] (ARROW-14798) [Python] Limit the size of the repr for large Tables
[ https://issues.apache.org/jira/browse/ARROW-14798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17478194#comment-17478194 ]
Joris Van den Bossche edited comment on ARROW-14798 at 1/18/22, 8:45 PM:
-------------------------------------------------------------------------
Can we keep this on 7.0.0? (or keep ARROW-15329 on 7.0.0?)
The table repr is basically unusable right now unless you are using small test data
was (Author: jorisvandenbossche):
Can we keep this on 7.0.0?
The table repr is basically unusable right now unless you are using small test data
> [Python] Limit the size of the repr for large Tables
> ----------------------------------------------------
>
> Key: ARROW-14798
> URL: https://issues.apache.org/jira/browse/ARROW-14798
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Python
> Reporter: Joris Van den Bossche
> Assignee: Will Jones
> Priority: Major
> Labels: good-first-issue, pull-request-available
> Fix For: 7.0.0
>
> Time Spent: 2h 20m
> Remaining Estimate: 0h
>
> The new repr is nice that it shows a preview of the data, but this can also become very long flooding your console output for larger tables.
> We already default to 10 preview cols, but each column can still consist of many chunks. So it might be good to also limit it to 2 chunks?
> The ChunkedArray.to_string method already has a {{window}} keyword, but that seems to control both the number of elements to show per chunk as the number of chunks (while it would be nice to limit eg to 2 chunks but show up to 10 elements for each chunk).
> cc [~amol-]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)