You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Acharki Mohammed (JIRA)" <ji...@apache.org> on 2017/10/21 17:33:03 UTC

[jira] [Created] (IMPALA-6097) Impala ODBC/JDBC bad performance - rows fetch is very slow from a remote server compared with NN

Acharki Mohammed created IMPALA-6097:
----------------------------------------

             Summary: Impala ODBC/JDBC bad performance - rows fetch is very slow from a remote server compared with NN 
                 Key: IMPALA-6097
                 URL: https://issues.apache.org/jira/browse/IMPALA-6097
             Project: IMPALA
          Issue Type: Bug
          Components: Distributed Exec, Frontend
    Affects Versions: Impala 2.9.0
         Environment: CDH 5.12.0
            Reporter: Acharki Mohammed


Hi,

In NameNode when I run the query via odbc script (php/perl or python), I can fetchAll results (9.2M) in a variable in about 30 seconds, but when I tried with the same script/query on another remote 2 servers, the execution time was in first server 28 min and in second 17 min.

To exclude the assuming that it's a network speed issue, I fetch the result on a file and then I copied it with scp command to the first remote server, and it finished in ~*0 seconds.

What I observe in Query info in CM that there is a big different between the *Threads: Network Send Wait Time* values in the 3 queries :

in NN query: 9.40s
in 1st remote server: 16.7m
in 2nd remote server: 26.8m

And also I try an java script with Impala *JDBC*, but the results in NN are already not stimulate to continue.

But so far I cann't find where is the problem and how can I resolve it.

NB: I'm working on CDH 5.12.0/Impala 2.9.0, I installed Impala ODBC 2.5.37.1014.
 

Hope you can respode to me ASAP, because this issue is a realy an obstacle of using a cluster that we made a several months to make it.
Thanks in advance.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)