You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "waltczhang (via GitHub)" <gi...@apache.org> on 2023/05/12 08:40:48 UTC

[GitHub] [iceberg] waltczhang opened a new issue, #7594: planFiles with ParallelIterator OOM(Out of memory)

waltczhang opened a new issue, #7594:
URL: https://github.com/apache/iceberg/issues/7594

   ### Query engine
   
   _No response_
   
   ### Question
   
   Background:
   When I was doing a full table scan (table DataFiles with millions or even more records), I found that using ParallelIterator often leads to OOM (out of memory) issues. The main reason is that the ParallelIterator queue model has a many-to-one relationship, meaning that there is at least one producer but only one consumer. Is there a need for some mechanism to control the speed or capacity of the generated data in this case?
   
   For example:
   `CloseableIterable<FileScanTask> fileScanTasks = table.newScan()..planFiles();
     fileScanTasks.forEach(...);`
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


Re: [I] planFiles with ParallelIterator OOM(Out of memory) [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] closed issue #7594: planFiles with ParallelIterator OOM(Out of memory)
URL: https://github.com/apache/iceberg/issues/7594


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] dramaticlly commented on issue #7594: planFiles with ParallelIterator OOM(Out of memory)

Posted by "dramaticlly (via GitHub)" <gi...@apache.org>.
dramaticlly commented on issue #7594:
URL: https://github.com/apache/iceberg/issues/7594#issuecomment-1545922875

   similar issue reported before in https://github.com/apache/iceberg/issues/4822


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


Re: [I] planFiles with ParallelIterator OOM(Out of memory) [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on issue #7594:
URL: https://github.com/apache/iceberg/issues/7594#issuecomment-1823681742

   This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


Re: [I] planFiles with ParallelIterator OOM(Out of memory) [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on issue #7594:
URL: https://github.com/apache/iceberg/issues/7594#issuecomment-1802943749

   This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org