You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2020/04/10 15:17:00 UTC

[jira] [Updated] (ARROW-8318) [C++][Dataset] Dataset should instantiate Fragment

     [ https://issues.apache.org/jira/browse/ARROW-8318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney updated ARROW-8318:
--------------------------------
    Component/s:     (was: C++ - Dataset)
                 C++

> [C++][Dataset] Dataset should instantiate Fragment
> --------------------------------------------------
>
>                 Key: ARROW-8318
>                 URL: https://issues.apache.org/jira/browse/ARROW-8318
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Francois Saint-Jacques
>            Priority: Major
>              Labels: dataset
>
> Fragments are created on the fly when invoking a Scan. This means that a lot of the auxilliary/ancilliary data must be stored by the specialised Dataset, e.g. the FileSystemDataset must hold the path and partition expression. With the venue of more complex Fragment, e.g. ParquetFileFragment, more data must be stored. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)