You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Sameera Wijerathne (Jira)" <ji...@apache.org> on 2022/06/28 08:33:00 UTC
[jira] [Created] (IMPALA-11400) Kudu scan bottleneck due to sharing a single Kudu client for multiple tablet scans
| ![](cid:jira-generated-image-avatar-e34c54fa-c758-4e7d-af1a-15e75193e254) |
[Sameera
Wijerathne](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=sameerawi)
**created** an issue
---|---
|
---
| [IMPALA](https://issues.apache.org/jira/browse/IMPALA) / [![Bug](cid:jira-
generated-image-
avatar-096a3c00-58d7-4cbf-8f83-776baf5606e9)](https://issues.apache.org/jira/browse/IMPALA-11400)
[IMPALA-11400](https://issues.apache.org/jira/browse/IMPALA-11400)
---
[Kudu scan bottleneck due to sharing a single Kudu client for multiple tablet
scans](https://issues.apache.org/jira/browse/IMPALA-11400)
| Issue Type: | ![Bug](cid:jira-generated-image-
avatar-096a3c00-58d7-4cbf-8f83-776baf5606e9) Bug
---|---
Affects Versions: | Impala 4.1.0
Assignee: | Unassigned
Attachments: | 0.JPG, 1.JPG, 2.JPG, 2.jpeg, 3.JPG, 4.JPG, 5.JPG,
Impala_1.png, Impala_2.png, Kudu_1.png, Kudu_2.png, WhatsApp Image 2022-06-07
at 10.39.27 PM.jpeg
Components: | Backend
Created: | 28/Jun/22 08:32
Fix Versions: | Impala 4.2.0
Labels: | performance
Priority: | ![Major](cid:jira-generated-image-static-
major-d8a89c57-cd9d-4894-bd00-8ec93433c775) Major
Reporter: | [Sameera
Wijerathne](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=sameerawi)
|
This issue was observed when impala queries large datasets resides in Kudu.
Even single ImpalaD is scanning multiple kudu tablets, it shows a slowness to
retrive data eventhough ImpalaD makes parrellel scans. Reason for this is
ImpalaD only uses a single Kudu client for multiple scans but
KuduScanner::NextBatch runs on a single thread. So it's rpc reactor thread
utilizes upto a single core and bottlenecks all parrelel scans.
This behaviour makes Impala clusters that scans kudu cannot be vertically
scales to the maximum performance/cores of a node.
Please refer the screenshots from Kudu slack channel for more information.
---
| | [ ![Add Comment](cid:jira-generated-image-static-comment-
icon-511b0294-df23-42dd-803e-65d4a465e594)
](https://issues.apache.org/jira/browse/IMPALA-11400#add-comment "Add
Comment") | [Add
Comment](https://issues.apache.org/jira/browse/IMPALA-11400#add-comment "Add
Comment")
---|---
| This message was sent by Atlassian Jira (v8.20.10#820010-sha1:ace47f9) | |
![Atlassian logo](https://issues.apache.org/jira/images/mail/atlassian-email-
logo.png)
---