You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by CHEBARO Abdallah <Ab...@murex.com> on 2014/08/12 14:21:28 UTC

Distributed data

Hello,

Using Hive, we know that we should specify the file path to read data from a specific location. If the data is distributed on many computers, how can we read it?

Thanks
*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.

RE: Distributed data

Posted by CHEBARO Abdallah <Ab...@murex.com>.
Hello,

Please explain to me : “If your hadoop is setup with same filesystem as hdfs, hive will take care of it “

From: Nitin Pawar [mailto:nitinpawar432@gmail.com]
Sent: Tuesday, August 12, 2014 3:50 PM
To: user@hive.apache.org
Subject: Re: Distributed data

If your hadoop is setup with same filesystem as hdfs, hive will take care of it

If your hdfs is totally different than where the file resides, then you need to get the file from that filesystem and then push it to hive using load

if that filesystem supports import/export with tools like sqoop then you can use them as well



On Tue, Aug 12, 2014 at 5:58 PM, CHEBARO Abdallah <Ab...@murex.com>> wrote:
Yes I mean the data is on hdfs like filesystem

From: Nitin Pawar [mailto:nitinpawar432@gmail.com<ma...@gmail.com>]
Sent: Tuesday, August 12, 2014 3:26 PM
To: user@hive.apache.org<ma...@hive.apache.org>
Subject: Re: Distributed data

what do you mean the data is distributed on many computers?

are you saying the data is on hdfs like filesystem ?

On Tue, Aug 12, 2014 at 5:51 PM, CHEBARO Abdallah <Ab...@murex.com>> wrote:
Hello,

Using Hive, we know that we should specify the file path to read data from a specific location. If the data is distributed on many computers, how can we read it?

Thanks

*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.



--
Nitin Pawar


*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.



--
Nitin Pawar
*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.

RE: Distributed data

Posted by CHEBARO Abdallah <Ab...@murex.com>.
First of all, thank you, the information is very helpful.

Can you please provide me more details about “If your hadoop is setup with same filesystem as hdfs, hive will take care of it “ ?

Thanks

From: Nitin Pawar [mailto:nitinpawar432@gmail.com]
Sent: Tuesday, August 12, 2014 3:50 PM
To: user@hive.apache.org
Subject: Re: Distributed data

If your hadoop is setup with same filesystem as hdfs, hive will take care of it

If your hdfs is totally different than where the file resides, then you need to get the file from that filesystem and then push it to hive using load

if that filesystem supports import/export with tools like sqoop then you can use them as well



On Tue, Aug 12, 2014 at 5:58 PM, CHEBARO Abdallah <Ab...@murex.com>> wrote:
Yes I mean the data is on hdfs like filesystem

From: Nitin Pawar [mailto:nitinpawar432@gmail.com<ma...@gmail.com>]
Sent: Tuesday, August 12, 2014 3:26 PM
To: user@hive.apache.org<ma...@hive.apache.org>
Subject: Re: Distributed data

what do you mean the data is distributed on many computers?

are you saying the data is on hdfs like filesystem ?

On Tue, Aug 12, 2014 at 5:51 PM, CHEBARO Abdallah <Ab...@murex.com>> wrote:
Hello,

Using Hive, we know that we should specify the file path to read data from a specific location. If the data is distributed on many computers, how can we read it?

Thanks

*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.



--
Nitin Pawar


*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.



--
Nitin Pawar
*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.

Re: Distributed data

Posted by Nitin Pawar <ni...@gmail.com>.
If your hadoop is setup with same filesystem as hdfs, hive will take care
of it

If your hdfs is totally different than where the file resides, then you
need to get the file from that filesystem and then push it to hive using
load

if that filesystem supports import/export with tools like sqoop then you
can use them as well




On Tue, Aug 12, 2014 at 5:58 PM, CHEBARO Abdallah <
Abdallah.CHEBARO@murex.com> wrote:

>  Yes I mean the data is on hdfs like filesystem
>
>
>
> *From:* Nitin Pawar [mailto:nitinpawar432@gmail.com]
> *Sent:* Tuesday, August 12, 2014 3:26 PM
> *To:* user@hive.apache.org
> *Subject:* Re: Distributed data
>
>
>
> what do you mean the data is distributed on many computers?
>
>
>
> are you saying the data is on hdfs like filesystem ?
>
>
>
> On Tue, Aug 12, 2014 at 5:51 PM, CHEBARO Abdallah <
> Abdallah.CHEBARO@murex.com> wrote:
>
> Hello,
>
>
>
> Using Hive, we know that we should specify the file path to read data from
> a specific location. If the data is distributed on many computers, how can
> we read it?
>
>
>
> Thanks
>
> *******************************
>
> This e-mail contains information for the intended recipient only. It may
> contain proprietary material or confidential information. If you are not
> the intended recipient you are not authorised to distribute, copy or use
> this e-mail or any attachment to it. Murex cannot guarantee that it is
> virus free and accepts no responsibility for any loss or damage arising
> from its use. If you have received this e-mail in error please notify
> immediately the sender and delete the original email received, any
> attachments and all copies from your system.
>
>
>
>
>
> --
> Nitin Pawar
>
>
>
> *******************************
>
> This e-mail contains information for the intended recipient only. It may
> contain proprietary material or confidential information. If you are not
> the intended recipient you are not authorised to distribute, copy or use
> this e-mail or any attachment to it. Murex cannot guarantee that it is
> virus free and accepts no responsibility for any loss or damage arising
> from its use. If you have received this e-mail in error please notify
> immediately the sender and delete the original email received, any
> attachments and all copies from your system.
>



-- 
Nitin Pawar

RE: Distributed data

Posted by CHEBARO Abdallah <Ab...@murex.com>.
Yes I mean the data is on hdfs like filesystem

From: Nitin Pawar [mailto:nitinpawar432@gmail.com]
Sent: Tuesday, August 12, 2014 3:26 PM
To: user@hive.apache.org
Subject: Re: Distributed data

what do you mean the data is distributed on many computers?

are you saying the data is on hdfs like filesystem ?

On Tue, Aug 12, 2014 at 5:51 PM, CHEBARO Abdallah <Ab...@murex.com>> wrote:
Hello,

Using Hive, we know that we should specify the file path to read data from a specific location. If the data is distributed on many computers, how can we read it?

Thanks

*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.



--
Nitin Pawar

*******************************

This e-mail contains information for the intended recipient only. It may contain proprietary material or confidential information. If you are not the intended recipient you are not authorised to distribute, copy or use this e-mail or any attachment to it. Murex cannot guarantee that it is virus free and accepts no responsibility for any loss or damage arising from its use. If you have received this e-mail in error please notify immediately the sender and delete the original email received, any attachments and all copies from your system.

Re: Distributed data

Posted by Nitin Pawar <ni...@gmail.com>.
what do you mean the data is distributed on many computers?

are you saying the data is on hdfs like filesystem ?


On Tue, Aug 12, 2014 at 5:51 PM, CHEBARO Abdallah <
Abdallah.CHEBARO@murex.com> wrote:

>  Hello,
>
>
>
> Using Hive, we know that we should specify the file path to read data from
> a specific location. If the data is distributed on many computers, how can
> we read it?
>
>
>
> Thanks
>
> *******************************
>
> This e-mail contains information for the intended recipient only. It may
> contain proprietary material or confidential information. If you are not
> the intended recipient you are not authorised to distribute, copy or use
> this e-mail or any attachment to it. Murex cannot guarantee that it is
> virus free and accepts no responsibility for any loss or damage arising
> from its use. If you have received this e-mail in error please notify
> immediately the sender and delete the original email received, any
> attachments and all copies from your system.
>



-- 
Nitin Pawar