You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Francois Saint-Jacques (Jira)" <ji...@apache.org> on 2020/04/02 17:46:00 UTC
[jira] [Created] (ARROW-8318) [C++][Dataset] Dataset should
instantiate Fragment
Francois Saint-Jacques created ARROW-8318:
---------------------------------------------
Summary: [C++][Dataset] Dataset should instantiate Fragment
Key: ARROW-8318
URL: https://issues.apache.org/jira/browse/ARROW-8318
Project: Apache Arrow
Issue Type: Improvement
Components: C++ - Dataset
Reporter: Francois Saint-Jacques
Fragments are created on the fly when invoking a Scan. This means that a lot of the auxilliary/ancilliary data must be stored by the specialised Dataset, e.g. the FileSystemDataset must hold the path and partition expression. With the venue of more complex Fragment, e.g. ParquetFileFragment, more data must be stored.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)