You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by David Zonsheine <da...@gmail.com> on 2011/03/30 15:30:50 UTC

insert - Hadoop vs. Hive

Hi,

I'm trying to compare adding files to hdfs for Hive usage using Hive inserts
vs. adding to the hdfs directly then using Hive.

Any comments, blogging about this?

Thanks a lot,

David Zonsheine

Re: insert - Hadoop vs. Hive

Posted by Ashish Thusoo <at...@fb.com>.
If the data is already in the right format you should use LOAD syntax in Hive. This basically moves files into hdfs (so it should be not less performant than hdfs). If the data is not in the correct format or it needs to be transformed then the insert statement needs to be used.

Ashish
On Mar 30, 2011, at 6:30 AM, David Zonsheine wrote:

> 
> Hi,
> 
> I'm trying to compare adding files to hdfs for Hive usage using Hive inserts vs. adding to the hdfs directly then using Hive.
> 
> Any comments, blogging about this?
> 
> Thanks a lot,
> 
> David Zonsheine
>