You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/11/30 09:21:53 UTC

[GitHub] [iceberg] camper42 opened a new issue, #6318: executor logs ton of `INFO CodecPool: Got brand-new decompressor [.zstd]`

camper42 opened a new issue, #6318:
URL: https://github.com/apache/iceberg/issues/6318

   ### Apache Iceberg version
   
   1.1.0 (latest release)
   
   ### Query engine
   
   Spark
   
   ### Please describe the bug 🐞
   
   call rewrite_data_files on a zstd+parquet table (feed by stream)
   
   executor logs too many `INFO CodecPool: Got brand-new decompressor [.zstd]` and I have to set log level to `WARN` or my pod log will fill up ephemeral storage
   
   Is this a side effect of #5681 ? 
   
   ![image](https://user-images.githubusercontent.com/3177898/204755406-0a75af51-5f98-409d-bfd5-4528017fe598.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] nastra commented on issue #6318: executor logs ton of `INFO CodecPool: Got brand-new decompressor [.zstd]`

Posted by GitBox <gi...@apache.org>.
nastra commented on issue #6318:
URL: https://github.com/apache/iceberg/issues/6318#issuecomment-1333600955

   I've looked at the code and this happened even before #5681. The decompressors are actually being re-used, but not across different Parquet Files. So this means in your case you have lots of Parquet files that are being read, and thus you're seeing this log output from Hadoop's `CodecPool`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] camper42 closed issue #6318: executor logs ton of `INFO CodecPool: Got brand-new decompressor [.zstd]`

Posted by GitBox <gi...@apache.org>.
camper42 closed issue #6318: executor logs ton of `INFO CodecPool: Got brand-new decompressor [.zstd]`
URL: https://github.com/apache/iceberg/issues/6318


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] camper42 commented on issue #6318: executor logs ton of `INFO CodecPool: Got brand-new decompressor [.zstd]`

Posted by GitBox <gi...@apache.org>.
camper42 commented on issue #6318:
URL: https://github.com/apache/iceberg/issues/6318#issuecomment-1333612142

   thx, so for my scenario, I may need to compact more promptly and/or set the log level of the `CodecPool` to `WARN`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org