You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hive.apache.org by Ryan LeCompte <le...@gmail.com> on 2010/03/02 21:56:53 UTC

Loading data into Hive with the partition column defined in the data

Hello all,

I know that you can load data into Hive via:

LOAD DATA LOCAL INPATH '/my/path/to/data.csv' OVERWRITE INTO TABLE my_table
PARTITION(dt='XYZ');

... where dt=XYZ is the partition where the data should be stored.

However, what if data.csv actually had the dt column specified? Is there a
way to get Hive to correctly insert each line of data.csv into my_table
under the right partition with one LOAD command?

Thanks,
Ryan

Re: Loading data into Hive with the partition column defined in the data

Posted by Ryan LeCompte <le...@gmail.com>.

Thanks!


On Tue, Mar 2, 2010 at 4:05 PM, Ning Zhang <nz...@facebook.com> wrote:

> It is not supported yet but I am working on it (see jira hive-936)
>
> Thanks,
> Ning
> ------
> Sent from my blackberry
>
> ------------------------------
>  *From*: Ryan LeCompte <le...@gmail.com>
> *To*: hive-user@hadoop.apache.org <hi...@hadoop.apache.org>
> *Sent*: Tue Mar 02 12:56:53 2010
> *Subject*: Loading data into Hive with the partition column defined in the
> data
>
> Hello all,
>
> I know that you can load data into Hive via:
>
> LOAD DATA LOCAL INPATH '/my/path/to/data.csv' OVERWRITE INTO TABLE my_table
> PARTITION(dt='XYZ');
>
> ... where dt=XYZ is the partition where the data should be stored.
>
> However, what if data.csv actually had the dt column specified? Is there a
> way to get Hive to correctly insert each line of data.csv into my_table
> under the right partition with one LOAD command?
>
> Thanks,
> Ryan
>
>
>
>
>

Re: Loading data into Hive with the partition column defined in the data

Posted by Ning Zhang <nz...@facebook.com>.

It is not supported yet but I am working on it (see jira hive-936)

Thanks,
Ning
------
Sent from my blackberry

________________________________
From: Ryan LeCompte <le...@gmail.com>
To: hive-user@hadoop.apache.org <hi...@hadoop.apache.org>
Sent: Tue Mar 02 12:56:53 2010
Subject: Loading data into Hive with the partition column defined in the data

Hello all,

I know that you can load data into Hive via:

LOAD DATA LOCAL INPATH '/my/path/to/data.csv' OVERWRITE INTO TABLE my_table PARTITION(dt='XYZ');

... where dt=XYZ is the partition where the data should be stored.

However, what if data.csv actually had the dt column specified? Is there a way to get Hive to correctly insert each line of data.csv into my_table under the right partition with one LOAD command?

Thanks,
Ryan