You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2020/12/15 02:55:05 UTC

[GitHub] [iceberg] txdong-sz opened a new issue #1934: remove a list .metadata.json in HadoopCatalog

txdong-sz opened a new issue #1934:
URL: https://github.com/apache/iceberg/issues/1934


   ![image](https://user-images.githubusercontent.com/5066512/102161744-d4d15c00-3ec2-11eb-9be1-bf04edce51e7.png)
   
   in hadoop cos scene  this list status may cost a lot of time  if there are many files in cos metadata.  
   and we spend about 10s  to list tables .
   
   if we remove (fs.listStatus(metadataPath, TABLE_FILTER).length >= 1) or  find a better way to list file like 
   ![image](https://user-images.githubusercontent.com/5066512/102161998-5aeda280-3ec3-11eb-8506-8c76d15c8cdb.png)
   
   
   ![image](https://user-images.githubusercontent.com/5066512/102161957-43161e80-3ec3-11eb-8d7e-59f22cff56c9.png)
   
   
   
   
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] jacques-n commented on issue #1934: remove a list .metadata.json in HadoopCatalog

Posted by GitBox <gi...@apache.org>.
jacques-n commented on issue #1934:
URL: https://github.com/apache/iceberg/issues/1934#issuecomment-745028445


   I'm guessing that FileSystem.listStatusIterator() could be faster since you only need to check if it is greater than 1 and it doesn't require a full traversal. @txdong-sz do you want to propose a patch? You'd need to implement the filter outside the call since I believe that call doesn't support filters.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] txdong-sz commented on issue #1934: remove a list .metadata.json in HadoopCatalog

Posted by GitBox <gi...@apache.org>.
txdong-sz commented on issue #1934:
URL: https://github.com/apache/iceberg/issues/1934#issuecomment-745713500


   @jacques-n 
   i also think that it is not need to check metadata.json   
   if namespace with metadata folder in   so we can just consider that is a table
   or eles 
   check if that metadata folder have a version-hint.text   file in it 
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org