You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/03/05 12:56:13 UTC

[GitHub] [arrow-datafusion] dispanser commented on issue #1384: Support `register_parquet` from a list of files

dispanser commented on issue #1384:
URL: https://github.com/apache/arrow-datafusion/issues/1384#issuecomment-1059759211


   I have a project with similar requirements, where files / partitions are determined by looking up a catalog with additional indexing capabilities to determine the files to read prior to passing them to datafusion.
   
   In our case, we don't actually want to register a table (as the selection of files changes with every query), but need something like `read_parquet(Vec<Path>)`.
   I'm not sure if that makes it sufficiently different to what's discussed here to make it a separate issue, but it sounds to require very similar functionality under the hood.
   
   I'm happy to help, but not well-versed in the datafusion codebase to drive the design decisions.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org