You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Vidhya Venkataraman <vi...@dropbox.com> on 2012/08/07 22:26:50 UTC

A possibly trivial join question

Is there a difference between the following two join sub queries in Hive?

A JOIN B ON (A.DUH1=B.DUH2) JOIN C ON (B.DUH2=C.DUH3)

vs.

A JOIN B JOIN C ON (A.DUH1=B.DUH2) AND (B.DUH2=C.DUH3)

?

And is it still necessary to keep the larger table at the very end? What if
tables B and C are selection queries that obtain a very small set from
table A?

Thanks
Vidhya