You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Janus Chow (Jira)" <ji...@apache.org> on 2019/12/12 04:17:01 UTC
[jira] [Comment Edited] (HADOOP-13144) Enhancing IPC client
throughput via multiple connections per user
[ https://issues.apache.org/jira/browse/HADOOP-13144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16994164#comment-16994164 ]
Janus Chow edited comment on HADOOP-13144 at 12/12/19 4:16 AM:
---------------------------------------------------------------
Attached patch named HADOOP-13144.
Our test logic is : Start 500 threads to call getBlockLocations on 1 directory with 100,000 parquet files, we use random policy to split the throughput to 2 Routers, the result is as follows.
|HADOOP-13144|inner_patch|processingAvg(ms)|proxyAvg(ms)|rpcProcessingTime(ms)|
|off|off|2.01,1.31|2.55,2.54|4.86,4.33|
|on|off|4,4|0.99,0.75|4.88,5.11|
|on|on|0.023,0.025|1.88,1.92|2.1,2.11|
HADOOP-13144 helps a lot to reduce proxyAvg.
was (Author: symious):
Attached path named HADOOP-13144.
Our test logic is : Start 500 threads to call getBlockLocations on 1 directory with 100,000 parquet files, we use random policy to split the throughput to 2 Routers, the result is as follows.
|HADOOP-13144|inner_patch|processingAvg(ms)|proxyAvg(ms)|rpcProcessingTime(ms)|
|off|off|2.01,1.31|2.55,2.54|4.86,4.33|
|on|off|4,4|0.99,0.75|4.88,5.11|
|on|on|0.023,0.025|1.88,1.92|2.1,2.11|
HADOOP-13144 helps a lot to reduce proxyAvg.
> Enhancing IPC client throughput via multiple connections per user
> -----------------------------------------------------------------
>
> Key: HADOOP-13144
> URL: https://issues.apache.org/jira/browse/HADOOP-13144
> Project: Hadoop Common
> Issue Type: Improvement
> Components: ipc
> Reporter: Jason Kace
> Assignee: Íñigo Goiri
> Priority: Minor
> Attachments: HADOOP-13144-performance.patch, HADOOP-13144.000.patch, HADOOP-13144.001.patch, HADOOP-13144.002.patch, HADOOP-13144.003.patch
>
>
> The generic IPC client ({{org.apache.hadoop.ipc.Client}}) utilizes a single connection thread for each {{ConnectionId}}. The {{ConnectionId}} is unique to the connection's remote address, ticket and protocol. Each ConnectionId is 1:1 mapped to a connection thread by the client via a map cache.
> The result is to serialize all IPC read/write activity through a single thread for a each user/ticket + address. If a single user makes repeated calls (1k-100k/sec) to the same destination, the IPC client becomes a bottleneck.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org