You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Acharki Mohammed (JIRA)" <ji...@apache.org> on 2017/10/21 17:33:03 UTC
[jira] [Created] (IMPALA-6097) Impala ODBC/JDBC bad performance -
rows fetch is very slow from a remote server compared with NN
Acharki Mohammed created IMPALA-6097:
----------------------------------------
Summary: Impala ODBC/JDBC bad performance - rows fetch is very slow from a remote server compared with NN
Key: IMPALA-6097
URL: https://issues.apache.org/jira/browse/IMPALA-6097
Project: IMPALA
Issue Type: Bug
Components: Distributed Exec, Frontend
Affects Versions: Impala 2.9.0
Environment: CDH 5.12.0
Reporter: Acharki Mohammed
Hi,
In NameNode when I run the query via odbc script (php/perl or python), I can fetchAll results (9.2M) in a variable in about 30 seconds, but when I tried with the same script/query on another remote 2 servers, the execution time was in first server 28 min and in second 17 min.
To exclude the assuming that it's a network speed issue, I fetch the result on a file and then I copied it with scp command to the first remote server, and it finished in ~*0 seconds.
What I observe in Query info in CM that there is a big different between the *Threads: Network Send Wait Time* values in the 3 queries :
in NN query: 9.40s
in 1st remote server: 16.7m
in 2nd remote server: 26.8m
And also I try an java script with Impala *JDBC*, but the results in NN are already not stimulate to continue.
But so far I cann't find where is the problem and how can I resolve it.
NB: I'm working on CDH 5.12.0/Impala 2.9.0, I installed Impala ODBC 2.5.37.1014.
Hope you can respode to me ASAP, because this issue is a realy an obstacle of using a cluster that we made a several months to make it.
Thanks in advance.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)