You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Hyunsik Choi (JIRA)" <ji...@apache.org> on 2014/07/28 07:27:38 UTC
[jira] [Created] (TAJO-982) Improve Fetcher to get multiple shuffle
outputs through a request
Hyunsik Choi created TAJO-982:
---------------------------------
Summary: Improve Fetcher to get multiple shuffle outputs through a request
Key: TAJO-982
URL: https://issues.apache.org/jira/browse/TAJO-982
Project: Tajo
Issue Type: Bug
Components: data shuffle
Reporter: Hyunsik Choi
Fix For: 0.9.0
Currently, Fetcher only can request at most a fetch for one shuffle output at a time. The implementation can cause performance degradation even though intermediate data is actually small.
For example, If an input data set of the first stage is big and the intermediate data is very small, QueryMaster will choose a few of nodes for next execution block. Since each node keeps limited threads for fetch, it will take a long time for the nodes for next stage to fetch all intermediate.
If Fetcher can get multiple shuffle outputs through a request, it would solve the slowness which occurs in some cases.
--
This message was sent by Atlassian JIRA
(v6.2#6252)