You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/11/07 17:21:55 UTC

[GitHub] [iceberg] dhruv-pratap commented on pull request #6069: Python: TableScan Plan files API implementation without residual evaluation

dhruv-pratap commented on PR #6069:
URL: https://github.com/apache/iceberg/pull/6069#issuecomment-1305942424

   In retrospect, I think this is becoming too large of a PR and would benefit from breaking down into smaller tasks. I'm going to go ahead and close this PR and if you guys are onboard I can create issues for the below:
   
   1. Add `ScanTask` interface
   2. Add `FileScanTask` class that extends `ScanTask` interface
   3. Fix ManifestEvalVisitor eq #6117 
   4. Add MetricsEvalVisitor that evaluates an expression on a DataFile to test whether rows in the file may match.
   5. Add ResidualEvalVisitor that evaluates the residuals for an expression the partitions in the given PartitionSpec.
   6. Add `TableScan` interface. @rdblue just picked this up in #6131 
   7. Add `DataTableScan` class that extends implements `plan_files()` API
   8. Expose plan_files API as scan subcommand via CLI.
   
   CC: @Fokko @rdblue @samredai 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org