You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Jay Ramadorai <jr...@tripadvisor.com> on 2011/03/08 00:01:23 UTC

Loading data into a Clustered/bucketed table

I am Sqooping data from an external source into a bucketed Hive table. Sqoop seems completely bucket-unaware, it simply used LOAD INPATH which moves the single file containing Sqooped data into the Hive warehouse location.

My question: 
- is there any way to get data into an empty clustered/bucketed table other than to first get the data into a temp table and the INSERT OVERWRITE SELECT from it? 
- Can INSERT OVERWRITE take its input from STDIN or a file instead of a Hive table?

thanks
Jay