You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Francois Saint-Jacques (Jira)" <ji...@apache.org> on 2020/04/02 17:46:00 UTC

[jira] [Created] (ARROW-8318) [C++][Dataset] Dataset should instantiate Fragment

Francois Saint-Jacques created ARROW-8318:
---------------------------------------------

             Summary: [C++][Dataset] Dataset should instantiate Fragment
                 Key: ARROW-8318
                 URL: https://issues.apache.org/jira/browse/ARROW-8318
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++ - Dataset
            Reporter: Francois Saint-Jacques


Fragments are created on the fly when invoking a Scan. This means that a lot of the auxilliary/ancilliary data must be stored by the specialised Dataset, e.g. the FileSystemDataset must hold the path and partition expression. With the venue of more complex Fragment, e.g. ParquetFileFragment, more data must be stored. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)