You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Awhan Patnaik <aw...@spotzot.com> on 2016/01/07 11:40:35 UTC
query execution time in hive
Hello all
With respect to command line hive shell, is the query execution time
reported by hive the total time elapsed since the issue of the query or
the actual time spent in the query itself?
I ask this because of the following observation. In general when I issue a
query, very soon I start to see map and reduce progress. This way I know my
query is running. Often times I see that no progress is shown. I am
assuming that the query is in ACCEPTED state as this point. But if the time
reported by hive includes this waiting time then I do not get an accurate
estimate of the query execution time. This is not helpful for query
performance profiling.
So the question is this. How to I get the query execution time for the
actual mapreduce processes without baby sitting the query?
Regards,
Awhan
Re: query execution time in hive
Posted by Awhan Patnaik <aw...@spotzot.com>.
Thanks.
On Thu, Jan 7, 2016 at 4:17 PM, Mich Talebzadeh <mi...@peridale.co.uk> wrote:
> As far as I can see the easiest option is to use
>
>
>
> select from_unixtime(unix_timestamp(), 'dd/MM/yyyy HH:mm:ss.ss') AS
> StartTime;
>
>
>
> .. your query here
>
>
>
> select from_unixtime(unix_timestamp(), 'dd/MM/yyyy HH:mm:ss.ss') AS
> EndTime;
>
>
>
>
>
> And delta will be the overall time
>
>
>
> HTH
>
>
>
> Dr Mich Talebzadeh
>
>
>
> LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*
>
>
>
> *Sybase ASE 15 Gold Medal Award 2008*
>
> A Winning Strategy: Running the most Critical Financial Data on ASE 15
>
>
> http://login.sybase.com/files/Product_Overviews/ASE-Winning-Strategy-091908.pdf
>
> Author of the books* "A Practitioner’s Guide to Upgrading to Sybase ASE
> 15", ISBN 978-0-9563693-0-7*.
>
> co-author *"Sybase Transact SQL Guidelines Best Practices", ISBN
> 978-0-9759693-0-4*
>
> *Publications due shortly:*
>
> *Complex Event Processing in Heterogeneous Environments*, ISBN:
> 978-0-9563693-3-8
>
> *Oracle and Sybase, Concepts and Contrasts*, ISBN: 978-0-9563693-1-4, volume
> one out shortly
>
>
>
> http://talebzadehmich.wordpress.com
>
>
>
> NOTE: The information in this email is proprietary and confidential. This
> message is for the designated recipient only, if you are not the intended
> recipient, you should destroy it immediately. Any information in this
> message shall not be understood as given or endorsed by Peridale Technology
> Ltd, its subsidiaries or their employees, unless expressly so stated. It is
> the responsibility of the recipient to ensure that this email is virus
> free, therefore neither Peridale Ltd, its subsidiaries nor their employees
> accept any responsibility.
>
>
>
> *From:* Awhan Patnaik [mailto:awhan@spotzot.com]
> *Sent:* 07 January 2016 10:41
> *To:* user@hive.apache.org
> *Subject:* query execution time in hive
>
>
>
> Hello all
>
> With respect to command line hive shell, is the query execution time
> reported by hive the total time elapsed since the issue of the query or
> the actual time spent in the query itself?
>
> I ask this because of the following observation. In general when I issue a
> query, very soon I start to see map and reduce progress. This way I know my
> query is running. Often times I see that no progress is shown. I am
> assuming that the query is in ACCEPTED state as this point. But if the time
> reported by hive includes this waiting time then I do not get an accurate
> estimate of the query execution time. This is not helpful for query
> performance profiling.
>
> So the question is this. How to I get the query execution time for the
> actual mapreduce processes without baby sitting the query?
>
> Regards,
>
> Awhan
>
RE: query execution time in hive
Posted by Mich Talebzadeh <mi...@peridale.co.uk>.
As far as I can see the easiest option is to use
select from_unixtime(unix_timestamp(), 'dd/MM/yyyy HH:mm:ss.ss') AS StartTime;
.. your query here
select from_unixtime(unix_timestamp(), 'dd/MM/yyyy HH:mm:ss.ss') AS EndTime;
And delta will be the overall time
HTH
Dr Mich Talebzadeh
LinkedIn <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
Sybase ASE 15 Gold Medal Award 2008
A Winning Strategy: Running the most Critical Financial Data on ASE 15
<http://login.sybase.com/files/Product_Overviews/ASE-Winning-Strategy-091908.pdf> http://login.sybase.com/files/Product_Overviews/ASE-Winning-Strategy-091908.pdf
Author of the books "A Practitioner’s Guide to Upgrading to Sybase ASE 15", ISBN 978-0-9563693-0-7.
co-author "Sybase Transact SQL Guidelines Best Practices", ISBN 978-0-9759693-0-4
Publications due shortly:
Complex Event Processing in Heterogeneous Environments, ISBN: 978-0-9563693-3-8
Oracle and Sybase, Concepts and Contrasts, ISBN: 978-0-9563693-1-4, volume one out shortly
<http://talebzadehmich.wordpress.com/> http://talebzadehmich.wordpress.com
NOTE: The information in this email is proprietary and confidential. This message is for the designated recipient only, if you are not the intended recipient, you should destroy it immediately. Any information in this message shall not be understood as given or endorsed by Peridale Technology Ltd, its subsidiaries or their employees, unless expressly so stated. It is the responsibility of the recipient to ensure that this email is virus free, therefore neither Peridale Ltd, its subsidiaries nor their employees accept any responsibility.
From: Awhan Patnaik [mailto:awhan@spotzot.com]
Sent: 07 January 2016 10:41
To: user@hive.apache.org
Subject: query execution time in hive
Hello all
With respect to command line hive shell, is the query execution time reported by hive the total time elapsed since the issue of the query or the actual time spent in the query itself?
I ask this because of the following observation. In general when I issue a query, very soon I start to see map and reduce progress. This way I know my query is running. Often times I see that no progress is shown. I am assuming that the query is in ACCEPTED state as this point. But if the time reported by hive includes this waiting time then I do not get an accurate estimate of the query execution time. This is not helpful for query performance profiling.
So the question is this. How to I get the query execution time for the actual mapreduce processes without baby sitting the query?
Regards,
Awhan