You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/07/31 03:44:52 UTC

[GitHub] [iceberg] amogh-jahagirdar commented on pull request #4401: Spark: add procedure to generate symlink manifests

amogh-jahagirdar commented on PR #4401:
URL: https://github.com/apache/iceberg/pull/4401#issuecomment-1200341801

   
   > @jackye1995 Is there a way to do the same thing using Java API? How can we read the location for all of data files given tableId and snapshotId?
   
   @prashantgohel1 You may have already gotten it, but  If you're looking for getting the data files for a table at a given snapshot, going through the Table APIs table.snapshot to get the snapshot, and then using snapshot's data manifest API to get all the data file manifests should work.https://github.com/apache/iceberg/blob/master/api/src/main/java/org/apache/iceberg/Snapshot.java#L101  Then it's a matter of using the Java library to read the manifest file.  https://github.com/apache/iceberg/blob/master/core/src/main/java/org/apache/iceberg/ManifestFiles.java#L71
   
   Another way is to query the data files metadata table.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org