You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-user@hadoop.apache.org by Felipe Gutierrez <fe...@gmail.com> on 2014/01/25 01:37:20 UTC
HIVE versus SQL DB
Hi,
I am in a project that has three databases with flat files. Our plan is to
normalize these DB in one. We will need to follow the Data warehouse
concept (ETL - Extraction, Transform, Load).
We are thinking to use Hadoop at the Transform step, because we need to
relate datas from the three databases. Do you think this is a good option?
Is there any tutorial/article about it?
We are also thinking to use HIVE to Extract the files, insert it on Hadoop
and use HIVE to query these datas. At this step we are going to eliminate
blank spaces, duplicate datas, transform a name register to an ID.
What are yours experience about this?
Thanks a lot for any contribution!
Felipe
--
*---- Felipe Oliveira Gutierrez-- Felipe.o.Gutierrez@gmail.com
<Fe...@gmail.com>--
https://sites.google.com/site/lipe82/Home/diaadia
<https://sites.google.com/site/lipe82/Home/diaadia>*
Re: HIVE versus SQL DB
Posted by "Martin, Nick" <Ni...@pssd.com>.
Hi Felipe,
The Hive user list will be the best place to post this question.
Thx
Nick
Sent from my iPhone
On Jan 24, 2014, at 7:37 PM, "Felipe Gutierrez" <fe...@gmail.com>> wrote:
Hi,
I am in a project that has three databases with flat files. Our plan is to normalize these DB in one. We will need to follow the Data warehouse concept (ETL - Extraction, Transform, Load).
We are thinking to use Hadoop at the Transform step, because we need to relate datas from the three databases. Do you think this is a good option? Is there any tutorial/article about it?
We are also thinking to use HIVE to Extract the files, insert it on Hadoop and use HIVE to query these datas. At this step we are going to eliminate blank spaces, duplicate datas, transform a name register to an ID.
What are yours experience about this?
Thanks a lot for any contribution!
Felipe
--
--
-- Felipe Oliveira Gutierrez
-- Felipe.o.Gutierrez@gmail.com<ma...@gmail.com>
-- https://sites.google.com/site/lipe82/Home/diaadia
Re: HIVE versus SQL DB
Posted by "Martin, Nick" <Ni...@pssd.com>.
Hi Felipe,
The Hive user list will be the best place to post this question.
Thx
Nick
Sent from my iPhone
On Jan 24, 2014, at 7:37 PM, "Felipe Gutierrez" <fe...@gmail.com>> wrote:
Hi,
I am in a project that has three databases with flat files. Our plan is to normalize these DB in one. We will need to follow the Data warehouse concept (ETL - Extraction, Transform, Load).
We are thinking to use Hadoop at the Transform step, because we need to relate datas from the three databases. Do you think this is a good option? Is there any tutorial/article about it?
We are also thinking to use HIVE to Extract the files, insert it on Hadoop and use HIVE to query these datas. At this step we are going to eliminate blank spaces, duplicate datas, transform a name register to an ID.
What are yours experience about this?
Thanks a lot for any contribution!
Felipe
--
--
-- Felipe Oliveira Gutierrez
-- Felipe.o.Gutierrez@gmail.com<ma...@gmail.com>
-- https://sites.google.com/site/lipe82/Home/diaadia
Re: HIVE versus SQL DB
Posted by "Martin, Nick" <Ni...@pssd.com>.
Hi Felipe,
The Hive user list will be the best place to post this question.
Thx
Nick
Sent from my iPhone
On Jan 24, 2014, at 7:37 PM, "Felipe Gutierrez" <fe...@gmail.com>> wrote:
Hi,
I am in a project that has three databases with flat files. Our plan is to normalize these DB in one. We will need to follow the Data warehouse concept (ETL - Extraction, Transform, Load).
We are thinking to use Hadoop at the Transform step, because we need to relate datas from the three databases. Do you think this is a good option? Is there any tutorial/article about it?
We are also thinking to use HIVE to Extract the files, insert it on Hadoop and use HIVE to query these datas. At this step we are going to eliminate blank spaces, duplicate datas, transform a name register to an ID.
What are yours experience about this?
Thanks a lot for any contribution!
Felipe
--
--
-- Felipe Oliveira Gutierrez
-- Felipe.o.Gutierrez@gmail.com<ma...@gmail.com>
-- https://sites.google.com/site/lipe82/Home/diaadia
Re: HIVE versus SQL DB
Posted by "Martin, Nick" <Ni...@pssd.com>.
Hi Felipe,
The Hive user list will be the best place to post this question.
Thx
Nick
Sent from my iPhone
On Jan 24, 2014, at 7:37 PM, "Felipe Gutierrez" <fe...@gmail.com>> wrote:
Hi,
I am in a project that has three databases with flat files. Our plan is to normalize these DB in one. We will need to follow the Data warehouse concept (ETL - Extraction, Transform, Load).
We are thinking to use Hadoop at the Transform step, because we need to relate datas from the three databases. Do you think this is a good option? Is there any tutorial/article about it?
We are also thinking to use HIVE to Extract the files, insert it on Hadoop and use HIVE to query these datas. At this step we are going to eliminate blank spaces, duplicate datas, transform a name register to an ID.
What are yours experience about this?
Thanks a lot for any contribution!
Felipe
--
--
-- Felipe Oliveira Gutierrez
-- Felipe.o.Gutierrez@gmail.com<ma...@gmail.com>
-- https://sites.google.com/site/lipe82/Home/diaadia