You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Krisztian Szucs (Jira)" <ji...@apache.org> on 2020/09/14 21:35:00 UTC
[jira] [Created] (ARROW-10006) [C++][Python] Do not collect python
iterators if not necessary
Krisztian Szucs created ARROW-10006:
---------------------------------------
Summary: [C++][Python] Do not collect python iterators if not necessary
Key: ARROW-10006
URL: https://issues.apache.org/jira/browse/ARROW-10006
Project: Apache Arrow
Issue Type: New Feature
Components: Python
Reporter: Krisztian Szucs
When converting python objects to arrow array currently we always collect the input to a sequence, but this may be memory consuming in certain cases.
For unknown sized iterators we could consume and temporarily store the seen items during inference potentially improving both the conversion time and peak memory usage.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)