You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Ben Kietzman (Jira)" <ji...@apache.org> on 2020/03/17 14:24:00 UTC

[jira] [Created] (ARROW-8137) [C++][Dataset] Investigate multithreaded discovery

Ben Kietzman created ARROW-8137:
-----------------------------------

             Summary: [C++][Dataset] Investigate multithreaded discovery
                 Key: ARROW-8137
                 URL: https://issues.apache.org/jira/browse/ARROW-8137
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++, C++ - Dataset
    Affects Versions: 0.16.0
            Reporter: Ben Kietzman
             Fix For: 1.0.0


Currently FileSystemDatasetFactory Inpsects all files serially. For slow file systems or systems which support batched reads, this could be accelerated by inspecting files in parallel.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)