You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "tustvold (via GitHub)" <gi...@apache.org> on 2023/05/01 16:45:30 UTC

[GitHub] [arrow-datafusion] tustvold opened a new issue, #6182: Faster ListingTable Listing

tustvold opened a new issue, #6182:
URL: https://github.com/apache/arrow-datafusion/issues/6182

   ### Is your feature request related to a problem or challenge?
   
   ListingTable currently has a very naive algorithm for finding files within a dataset that serially lists every file in the dataset, and then applies pruning to each returned file
   
   ### Describe the solution you'd like
   
   The partition pruning logic should instead list each partition separately, ideally in parallel, prune these partitions, and then return the list of contained files
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] tustvold closed issue #6182: Faster ListingTable Listing

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold closed issue #6182: Faster ListingTable Listing
URL: https://github.com/apache/arrow-datafusion/issues/6182


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org