You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2021/12/08 22:55:00 UTC
[jira] [Comment Edited] (ARROW-15036) [C++] Expose S3 SDK configuration parameter "maxConnections"
[ https://issues.apache.org/jira/browse/ARROW-15036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17456047#comment-17456047 ]
Weston Pace edited comment on ARROW-15036 at 12/8/21, 10:54 PM:
----------------------------------------------------------------
Actually, I think we could take two different routes here.
Option 1: Yet another S3Options config variable
- Pro: More flexible
- Con: Requires the user to set it correctly and line it up with the I/O thread pool size
Option 2: Set maxConnections to io_context_.executor()->GetCapacity()
- Pro: Automatic configuration, don't need to burden the user with details
- Con: May do the wrong thing if the user resizes the pool after creating the filesystem
- Con: If the user is using S3Options::background_writes they may want more connections than I/O pool threads
[~apitrou] Any opinion or suggestion for a different approach?
was (Author: westonpace):
Actually, I think we could take two different routes here.
Option 1: Yet another S3Options config variable
- Pros: More flexible
- Cons: Requires the user to set it correctly and line it up with the I/O thread pool size
Option 2: Set maxConnections to io_context_.executor()->GetCapacity()
- Pros: Automatic configuration, don't need to burden the user with details
- Cons: May do the wrong thing if the user resizes the pool after creating the filesystem
- If the user is using S3Options::background_writes they may want more connections than I/O pool threads
[~apitrou] Any opinion or suggestion for a different approach?
> [C++] Expose S3 SDK configuration parameter "maxConnections"
> ------------------------------------------------------------
>
> Key: ARROW-15036
> URL: https://issues.apache.org/jira/browse/ARROW-15036
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Weston Pace
> Priority: Major
>
> This is primarily inspired by ARROW-14965 where it seems that the default (25) is limiting S3 filesystem performance.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)