You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Dima Fadeyev <df...@pragsis.com> on 2014/09/25 14:54:12 UTC

query with FIST_VALUE/LAST_VALUE functions keep running forever

Hello everyone,

I'm trying to run a query on an 8 node cluster with hive-0.13 (MapR 3.1.1):

SELECT FIRST_VALUE(col_a) OVER (PARTITION BY col_b ORDER BY col_c) FROM 
test;

If any partition is over 30000 rows, the reduce phase of my query keeps 
running forever (until job is being killed by JobTracker).

Is this normal behavior? A normal ORDER BY on a table of 7 millions of 
rows takes about 70 seconds to complete on the same cluster.

Thanks and best regards,
-- 

AVISO CONFIDENCIAL\nEste correo y la información contenida o adjunta al mismo es privada y confidencial y va dirigida exclusivamente a su destinatario. Pragsis informa a quien pueda haber recibido este correo por error que contiene información confidencial cuyo uso, copia, reproducción o distribución está expresamente prohibida. Si no es Vd. el destinatario del mismo y recibe este correo por error, le rogamos lo ponga en conocimiento del emisor y proceda a su eliminación sin copiarlo, imprimirlo o utilizarlo de ningún modo.\nCONFIDENTIALITY WARNING.\nThis message and the information contained in or attached to it are private and confidential and intended exclusively for the addressee. Pragsis informs to whom it may receive it in error that it contains privileged information and its use, copy, reproduction or distribution is prohibited. If you are not an intended recipient of this E-mail, please notify the sender, delete it and do not read, act upon, print, disclose, copy, reta
 in or redistribute any portion of this E-mail.