You are viewing a plain text version of this content. The canonical link for it is here.

Posted to mapreduce-user@hadoop.apache.org by Jean-Marc Spaggiari <je...@spaggiari.org> on 2013/02/11 02:57:39 UTC

Mutiple dfs.data.dir vs RAID0

Hi,

I have a quick question regarding RAID0 performances vs multiple
dfs.data.dir entries.

Let's say I have 2 x 2TB drives.

I can configure them as 2 separate drives mounted on 2 folders and
assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
with RAID0 and assigned them as a single folder to dfs.data.dir.

With RAID0, the reads and writes are going to be spread over the 2
disks. This is significantly increasing the speed. But if I put 2
entries in dfs.data.dir, hadoop is going to spread over those 2
directories too, and at the end, ths results should the same, no?

Any experience/advice/results to share?

Thanks,

JM

Re: Mutiple dfs.data.dir vs RAID0

Posted by Chris Embree <ce...@gmail.com>.

Interesting question.  You'd probably need to benchmark to prove it out.

I'm not the exact details of how HDFS stripes data, but it should compare
pretty well to hardware RAID.

Conceptually, HDFS should be able to out perform a RAID solution, since
HDFS "knows" more about the data being written.  One of the benefits of
HDFS is being able to buy cheaper hardware and still getting good
performance.

We bought cheap DL165's for our datanodes.  4x 2TB Drives with no RAID.

On Sun, Feb 10, 2013 at 8:57 PM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Chris Embree <ce...@gmail.com>.

Interesting question.  You'd probably need to benchmark to prove it out.

I'm not the exact details of how HDFS stripes data, but it should compare
pretty well to hardware RAID.

Conceptually, HDFS should be able to out perform a RAID solution, since
HDFS "knows" more about the data being written.  One of the benefits of
HDFS is being able to buy cheaper hardware and still getting good
performance.

We bought cheap DL165's for our datanodes.  4x 2TB Drives with no RAID.

On Sun, Feb 10, 2013 at 8:57 PM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

@Michael:
I have done some tests between RAID0, 1, JBOD and LVM on another server.

Results are there:
http://www.spaggiari.org/index.php/hbase/hard-drives-performances
LVM and JBOD were close, that's why I talked about LVM, since it seems
to be pretty close to JBOD performance wyse and can be done on any
hardware even if the MB is not proposing any RAID/JBOD option.

@Chris:
I will have to test and see. Like what if I had a drive now to an
existing DataNode? Is it going to spread it's existing data over the 2
drives? Or are they going to grow the same speed?

I will add one drive to one server tomorrow and see the results...
Then I will run some performances tests and see...

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM
>>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

On Mon, Feb 11, 2013 at 10:54 AM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>

"Hadoop Operations" by E. Sammer:
http://shop.oreilly.com/product/0636920025085.do


>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

On Mon, Feb 11, 2013 at 10:54 AM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>

"Hadoop Operations" by E. Sammer:
http://shop.oreilly.com/product/0636920025085.do


>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Olivier Renault <or...@hortonworks.com>.

You can also rebalance the disk using the steps describe in the FAQ
http://wiki.apache.org/hadoop/FAQ#On_an_individual_data_node.2C_how_do_you_balance_the_blocks_on_the_disk.3F

Olivier


On 11 February 2013 15:54, Jean-Marc Spaggiari <je...@spaggiari.org>wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>



-- 
Olivier Renault
Solution Engineer - Big Data - Hortonworks, Inc.
+44 7500 933 036
orenault@hortonworks.com
www.hortonworks.com
<http://hortonworks.com/products/hortonworks-sandbox/>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

On Mon, Feb 11, 2013 at 10:54 AM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>

"Hadoop Operations" by E. Sammer:
http://shop.oreilly.com/product/0636920025085.do


>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

On Mon, Feb 11, 2013 at 10:54 AM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>

"Hadoop Operations" by E. Sammer:
http://shop.oreilly.com/product/0636920025085.do


>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Olivier Renault <or...@hortonworks.com>.

You can also rebalance the disk using the steps describe in the FAQ
http://wiki.apache.org/hadoop/FAQ#On_an_individual_data_node.2C_how_do_you_balance_the_blocks_on_the_disk.3F

Olivier


On 11 February 2013 15:54, Jean-Marc Spaggiari <je...@spaggiari.org>wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>



-- 
Olivier Renault
Solution Engineer - Big Data - Hortonworks, Inc.
+44 7500 933 036
orenault@hortonworks.com
www.hortonworks.com
<http://hortonworks.com/products/hortonworks-sandbox/>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Olivier Renault <or...@hortonworks.com>.

You can also rebalance the disk using the steps describe in the FAQ
http://wiki.apache.org/hadoop/FAQ#On_an_individual_data_node.2C_how_do_you_balance_the_blocks_on_the_disk.3F

Olivier


On 11 February 2013 15:54, Jean-Marc Spaggiari <je...@spaggiari.org>wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>



-- 
Olivier Renault
Solution Engineer - Big Data - Hortonworks, Inc.
+44 7500 933 036
orenault@hortonworks.com
www.hortonworks.com
<http://hortonworks.com/products/hortonworks-sandbox/>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Olivier Renault <or...@hortonworks.com>.

You can also rebalance the disk using the steps describe in the FAQ
http://wiki.apache.org/hadoop/FAQ#On_an_individual_data_node.2C_how_do_you_balance_the_blocks_on_the_disk.3F

Olivier


On 11 February 2013 15:54, Jean-Marc Spaggiari <je...@spaggiari.org>wrote:

> thanks all for your feebacks.
>
> I have updated with hdfs config to add another dfs.data.dir entry and
> restarted the node. Hadoop is starting to use the entry, but is not
> spreading the existing data over the 2 directories.
>
> Let's say you have a 2TB disk on /hadoop1, almost full. If you add
> another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
> start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
> almost full. It will not balance the already existing data over the 2
> directories.
>
> I have deleted all the content of /hadoop1 and /hadoop2 and restarted
> the node and now the data is spread over the 2. Just need to wait for
> the replication to complete.
>
> So what I will do instead is, I will add 2 x 2TB drives, mount them as
> raid0 then move the existing data into this drive and remove the
> reprious one. That way hadoop will see still one directory under
> /hadoop1 but it will be 4TB instead of 2TB...
>
> Is there anywhere where I can read about hadoop vs the different kind
> of physical data storage configuration? (Book, web, etc.)
>
> JM
>
> 2013/2/11, Ted Dunning <td...@maprtech.com>:
> > Typical best practice is to have a separate file system per spindle.  If
> > you have a RAID only controller (many are), then you just create one RAID
> > per spindle.  The effect is the same.
> >
> > MapR is unusual able to stripe writes over multiple drives organized
> into a
> > storage pool, but you will not normally be able to achieve that same
> level
> > of performance with ordinary Hadoop by using LVM over JBOD or controller
> > level RAID.  The problem is that the Java layer doesn't understand that
> the
> > storage is striped and the controller doesn't understand what Hadoop is
> > doing.  MapR schedules all of the writes to individual spindles via a
> very
> > fast state machine embedded in the file system.
> >
> > The comment about striping increasing the impact of a single disk drive
> is
> > exactly correct and it makes modeling the failure modes of the system
> > considerably more complex.  The net result of the modeling that I and
> > others have done is that moderate to large RAID groups in storage pools
> for
> > moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> > clusters of up to 10,000 nodes, you should probably limit RAID groups to
> 4
> > drives or less.
> >
> > On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
> >
> >>  We have seen in several of our Hadoop clusters that LVM degrades
> >> performance of our M/R jobs, and I remembered a message where
> >> Ted Dunning was explaining something about this, and since
> >> that time, we don't use LVM for Hadoop data directories.
> >>
> >> About RAID volumes, the best performance that we have achieved
> >> is using RAID 10 for our Hadoop data directories.
> >>
> >>
> >>
> >> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> >>
> >> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> >> each disk as its own RAID0 volume...
> >>
> >> Not sure why or where LVM comes into the picture here ... LVM is on
> >> the software layer and (hopefully) the RAID/JBOD stuff is at the
> >> hardware layer (and in the case of HDFS, LVM will only add unneeded
> >> overhead).
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  The issue is that my MB is not doing JBOD :( I have RAID only
> >> possible, and I'm fighting for the last 48h and still not able to make
> >> it work... That's why I'm thinking about using dfs.data.dir instead.
> >>
> >> I have 1 drive per node so far and need to move to 2 to reduce WIO.
> >>
> >> What will be better with JBOD against dfs.data.dir? I have done some
> >> tests JBOD vs LVM and did not find any pros for JBOD so far.
> >>
> >> JM
> >>
> >> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
> >> <mi...@cloudera.com>:
> >>
> >>  One thought comes to mind: disk failure. In the event a disk goes bad,
> >> then with RAID0, you just lost your entire array. With JBOD, you lost
> >> one disk.
> >>
> >> -Michael
> >>
> >> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<
> jean-marc@spaggiari.org>
> >> <je...@spaggiari.org> wrote:
> >>
> >>
> >>  Hi,
> >>
> >> I have a quick question regarding RAID0 performances vs multiple
> >> dfs.data.dir entries.
> >>
> >> Let's say I have 2 x 2TB drives.
> >>
> >> I can configure them as 2 separate drives mounted on 2 folders and
> >> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> >> with RAID0 and assigned them as a single folder to dfs.data.dir.
> >>
> >> With RAID0, the reads and writes are going to be spread over the 2
> >> disks. This is significantly increasing the speed. But if I put 2
> >> entries in dfs.data.dir, hadoop is going to spread over those 2
> >> directories too, and at the end, ths results should the same, no?
> >>
> >> Any experience/advice/results to share?
> >>
> >> Thanks,
> >>
> >> JM
> >>
> >>
> >> --
> >> Marcos Ortiz Valmaseda,
> >> Product Manager && Data Scientist at UCI
> >> Blog: http://marcosluis2186.posterous.com
> >> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
> >>
> >
>



-- 
Olivier Renault
Solution Engineer - Big Data - Hortonworks, Inc.
+44 7500 933 036
orenault@hortonworks.com
www.hortonworks.com
<http://hortonworks.com/products/hortonworks-sandbox/>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

thanks all for your feebacks.

I have updated with hdfs config to add another dfs.data.dir entry and
restarted the node. Hadoop is starting to use the entry, but is not
spreading the existing data over the 2 directories.

Let's say you have a 2TB disk on /hadoop1, almost full. If you add
another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
almost full. It will not balance the already existing data over the 2
directories.

I have deleted all the content of /hadoop1 and /hadoop2 and restarted
the node and now the data is spread over the 2. Just need to wait for
the replication to complete.

So what I will do instead is, I will add 2 x 2TB drives, mount them as
raid0 then move the existing data into this drive and remove the
reprious one. That way hadoop will see still one directory under
/hadoop1 but it will be 4TB instead of 2TB...

Is there anywhere where I can read about hadoop vs the different kind
of physical data storage configuration? (Book, web, etc.)

JM

2013/2/11, Ted Dunning <td...@maprtech.com>:
> Typical best practice is to have a separate file system per spindle.  If
> you have a RAID only controller (many are), then you just create one RAID
> per spindle.  The effect is the same.
>
> MapR is unusual able to stripe writes over multiple drives organized into a
> storage pool, but you will not normally be able to achieve that same level
> of performance with ordinary Hadoop by using LVM over JBOD or controller
> level RAID.  The problem is that the Java layer doesn't understand that the
> storage is striped and the controller doesn't understand what Hadoop is
> doing.  MapR schedules all of the writes to individual spindles via a very
> fast state machine embedded in the file system.
>
> The comment about striping increasing the impact of a single disk drive is
> exactly correct and it makes modeling the failure modes of the system
> considerably more complex.  The net result of the modeling that I and
> others have done is that moderate to large RAID groups in storage pools for
> moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
> drives or less.
>
> On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
>
>>  We have seen in several of our Hadoop clusters that LVM degrades
>> performance of our M/R jobs, and I remembered a message where
>> Ted Dunning was explaining something about this, and since
>> that time, we don't use LVM for Hadoop data directories.
>>
>> About RAID volumes, the best performance that we have achieved
>> is using RAID 10 for our Hadoop data directories.
>>
>>
>>
>> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>>
>> Are you able to create multiple RAID0 volumes? Perhaps you can expose
>> each disk as its own RAID0 volume...
>>
>> Not sure why or where LVM comes into the picture here ... LVM is on
>> the software layer and (hopefully) the RAID/JBOD stuff is at the
>> hardware layer (and in the case of HDFS, LVM will only add unneeded
>> overhead).
>>
>> -Michael
>>
>> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
>> <mi...@cloudera.com>:
>>
>>  One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>>
>>
>> --
>> Marcos Ortiz Valmaseda,
>> Product Manager && Data Scientist at UCI
>> Blog: http://marcosluis2186.posterous.com
>> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

thanks all for your feebacks.

I have updated with hdfs config to add another dfs.data.dir entry and
restarted the node. Hadoop is starting to use the entry, but is not
spreading the existing data over the 2 directories.

Let's say you have a 2TB disk on /hadoop1, almost full. If you add
another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
almost full. It will not balance the already existing data over the 2
directories.

I have deleted all the content of /hadoop1 and /hadoop2 and restarted
the node and now the data is spread over the 2. Just need to wait for
the replication to complete.

So what I will do instead is, I will add 2 x 2TB drives, mount them as
raid0 then move the existing data into this drive and remove the
reprious one. That way hadoop will see still one directory under
/hadoop1 but it will be 4TB instead of 2TB...

Is there anywhere where I can read about hadoop vs the different kind
of physical data storage configuration? (Book, web, etc.)

JM

2013/2/11, Ted Dunning <td...@maprtech.com>:
> Typical best practice is to have a separate file system per spindle.  If
> you have a RAID only controller (many are), then you just create one RAID
> per spindle.  The effect is the same.
>
> MapR is unusual able to stripe writes over multiple drives organized into a
> storage pool, but you will not normally be able to achieve that same level
> of performance with ordinary Hadoop by using LVM over JBOD or controller
> level RAID.  The problem is that the Java layer doesn't understand that the
> storage is striped and the controller doesn't understand what Hadoop is
> doing.  MapR schedules all of the writes to individual spindles via a very
> fast state machine embedded in the file system.
>
> The comment about striping increasing the impact of a single disk drive is
> exactly correct and it makes modeling the failure modes of the system
> considerably more complex.  The net result of the modeling that I and
> others have done is that moderate to large RAID groups in storage pools for
> moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
> drives or less.
>
> On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
>
>>  We have seen in several of our Hadoop clusters that LVM degrades
>> performance of our M/R jobs, and I remembered a message where
>> Ted Dunning was explaining something about this, and since
>> that time, we don't use LVM for Hadoop data directories.
>>
>> About RAID volumes, the best performance that we have achieved
>> is using RAID 10 for our Hadoop data directories.
>>
>>
>>
>> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>>
>> Are you able to create multiple RAID0 volumes? Perhaps you can expose
>> each disk as its own RAID0 volume...
>>
>> Not sure why or where LVM comes into the picture here ... LVM is on
>> the software layer and (hopefully) the RAID/JBOD stuff is at the
>> hardware layer (and in the case of HDFS, LVM will only add unneeded
>> overhead).
>>
>> -Michael
>>
>> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
>> <mi...@cloudera.com>:
>>
>>  One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>>
>>
>> --
>> Marcos Ortiz Valmaseda,
>> Product Manager && Data Scientist at UCI
>> Blog: http://marcosluis2186.posterous.com
>> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

thanks all for your feebacks.

I have updated with hdfs config to add another dfs.data.dir entry and
restarted the node. Hadoop is starting to use the entry, but is not
spreading the existing data over the 2 directories.

Let's say you have a 2TB disk on /hadoop1, almost full. If you add
another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
almost full. It will not balance the already existing data over the 2
directories.

I have deleted all the content of /hadoop1 and /hadoop2 and restarted
the node and now the data is spread over the 2. Just need to wait for
the replication to complete.

So what I will do instead is, I will add 2 x 2TB drives, mount them as
raid0 then move the existing data into this drive and remove the
reprious one. That way hadoop will see still one directory under
/hadoop1 but it will be 4TB instead of 2TB...

Is there anywhere where I can read about hadoop vs the different kind
of physical data storage configuration? (Book, web, etc.)

JM

2013/2/11, Ted Dunning <td...@maprtech.com>:
> Typical best practice is to have a separate file system per spindle.  If
> you have a RAID only controller (many are), then you just create one RAID
> per spindle.  The effect is the same.
>
> MapR is unusual able to stripe writes over multiple drives organized into a
> storage pool, but you will not normally be able to achieve that same level
> of performance with ordinary Hadoop by using LVM over JBOD or controller
> level RAID.  The problem is that the Java layer doesn't understand that the
> storage is striped and the controller doesn't understand what Hadoop is
> doing.  MapR schedules all of the writes to individual spindles via a very
> fast state machine embedded in the file system.
>
> The comment about striping increasing the impact of a single disk drive is
> exactly correct and it makes modeling the failure modes of the system
> considerably more complex.  The net result of the modeling that I and
> others have done is that moderate to large RAID groups in storage pools for
> moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
> drives or less.
>
> On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
>
>>  We have seen in several of our Hadoop clusters that LVM degrades
>> performance of our M/R jobs, and I remembered a message where
>> Ted Dunning was explaining something about this, and since
>> that time, we don't use LVM for Hadoop data directories.
>>
>> About RAID volumes, the best performance that we have achieved
>> is using RAID 10 for our Hadoop data directories.
>>
>>
>>
>> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>>
>> Are you able to create multiple RAID0 volumes? Perhaps you can expose
>> each disk as its own RAID0 volume...
>>
>> Not sure why or where LVM comes into the picture here ... LVM is on
>> the software layer and (hopefully) the RAID/JBOD stuff is at the
>> hardware layer (and in the case of HDFS, LVM will only add unneeded
>> overhead).
>>
>> -Michael
>>
>> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
>> <mi...@cloudera.com>:
>>
>>  One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>>
>>
>> --
>> Marcos Ortiz Valmaseda,
>> Product Manager && Data Scientist at UCI
>> Blog: http://marcosluis2186.posterous.com
>> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

thanks all for your feebacks.

I have updated with hdfs config to add another dfs.data.dir entry and
restarted the node. Hadoop is starting to use the entry, but is not
spreading the existing data over the 2 directories.

Let's say you have a 2TB disk on /hadoop1, almost full. If you add
another 2TB disk on /hadoop2 and add it on dfs.data.dir, hadoop will
start to write into /hadoop1 and /hadoop2, but /hadoop1 will stay
almost full. It will not balance the already existing data over the 2
directories.

I have deleted all the content of /hadoop1 and /hadoop2 and restarted
the node and now the data is spread over the 2. Just need to wait for
the replication to complete.

So what I will do instead is, I will add 2 x 2TB drives, mount them as
raid0 then move the existing data into this drive and remove the
reprious one. That way hadoop will see still one directory under
/hadoop1 but it will be 4TB instead of 2TB...

Is there anywhere where I can read about hadoop vs the different kind
of physical data storage configuration? (Book, web, etc.)

JM

2013/2/11, Ted Dunning <td...@maprtech.com>:
> Typical best practice is to have a separate file system per spindle.  If
> you have a RAID only controller (many are), then you just create one RAID
> per spindle.  The effect is the same.
>
> MapR is unusual able to stripe writes over multiple drives organized into a
> storage pool, but you will not normally be able to achieve that same level
> of performance with ordinary Hadoop by using LVM over JBOD or controller
> level RAID.  The problem is that the Java layer doesn't understand that the
> storage is striped and the controller doesn't understand what Hadoop is
> doing.  MapR schedules all of the writes to individual spindles via a very
> fast state machine embedded in the file system.
>
> The comment about striping increasing the impact of a single disk drive is
> exactly correct and it makes modeling the failure modes of the system
> considerably more complex.  The net result of the modeling that I and
> others have done is that moderate to large RAID groups in storage pools for
> moderate sized clusters (< 2000 nodes or so) is just fine.  For large
> clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
> drives or less.
>
> On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:
>
>>  We have seen in several of our Hadoop clusters that LVM degrades
>> performance of our M/R jobs, and I remembered a message where
>> Ted Dunning was explaining something about this, and since
>> that time, we don't use LVM for Hadoop data directories.
>>
>> About RAID volumes, the best performance that we have achieved
>> is using RAID 10 for our Hadoop data directories.
>>
>>
>>
>> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>>
>> Are you able to create multiple RAID0 volumes? Perhaps you can expose
>> each disk as its own RAID0 volume...
>>
>> Not sure why or where LVM comes into the picture here ... LVM is on
>> the software layer and (hopefully) the RAID/JBOD stuff is at the
>> hardware layer (and in the case of HDFS, LVM will only add unneeded
>> overhead).
>>
>> -Michael
>>
>> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>
>> <mi...@cloudera.com>:
>>
>>  One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org>
>> <je...@spaggiari.org> wrote:
>>
>>
>>  Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>>
>>
>> --
>> Marcos Ortiz Valmaseda,
>> Product Manager && Data Scientist at UCI
>> Blog: http://marcosluis2186.posterous.com
>> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Ted Dunning <td...@maprtech.com>.

Typical best practice is to have a separate file system per spindle.  If
you have a RAID only controller (many are), then you just create one RAID
per spindle.  The effect is the same.

MapR is unusual able to stripe writes over multiple drives organized into a
storage pool, but you will not normally be able to achieve that same level
of performance with ordinary Hadoop by using LVM over JBOD or controller
level RAID.  The problem is that the Java layer doesn't understand that the
storage is striped and the controller doesn't understand what Hadoop is
doing.  MapR schedules all of the writes to individual spindles via a very
fast state machine embedded in the file system.

The comment about striping increasing the impact of a single disk drive is
exactly correct and it makes modeling the failure modes of the system
considerably more complex.  The net result of the modeling that I and
others have done is that moderate to large RAID groups in storage pools for
moderate sized clusters (< 2000 nodes or so) is just fine.  For large
clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
drives or less.

On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:

>  We have seen in several of our Hadoop clusters that LVM degrades
> performance of our M/R jobs, and I remembered a message where
> Ted Dunning was explaining something about this, and since
> that time, we don't use LVM for Hadoop data directories.
>
> About RAID volumes, the best performance that we have achieved
> is using RAID 10 for our Hadoop data directories.
>
>
>
> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com> <mi...@cloudera.com>:
>
>  One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>
>
> --
> Marcos Ortiz Valmaseda,
> Product Manager && Data Scientist at UCI
> Blog: http://marcosluis2186.posterous.com
> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Ted Dunning <td...@maprtech.com>.

Typical best practice is to have a separate file system per spindle.  If
you have a RAID only controller (many are), then you just create one RAID
per spindle.  The effect is the same.

MapR is unusual able to stripe writes over multiple drives organized into a
storage pool, but you will not normally be able to achieve that same level
of performance with ordinary Hadoop by using LVM over JBOD or controller
level RAID.  The problem is that the Java layer doesn't understand that the
storage is striped and the controller doesn't understand what Hadoop is
doing.  MapR schedules all of the writes to individual spindles via a very
fast state machine embedded in the file system.

The comment about striping increasing the impact of a single disk drive is
exactly correct and it makes modeling the failure modes of the system
considerably more complex.  The net result of the modeling that I and
others have done is that moderate to large RAID groups in storage pools for
moderate sized clusters (< 2000 nodes or so) is just fine.  For large
clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
drives or less.

On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:

>  We have seen in several of our Hadoop clusters that LVM degrades
> performance of our M/R jobs, and I remembered a message where
> Ted Dunning was explaining something about this, and since
> that time, we don't use LVM for Hadoop data directories.
>
> About RAID volumes, the best performance that we have achieved
> is using RAID 10 for our Hadoop data directories.
>
>
>
> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com> <mi...@cloudera.com>:
>
>  One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>
>
> --
> Marcos Ortiz Valmaseda,
> Product Manager && Data Scientist at UCI
> Blog: http://marcosluis2186.posterous.com
> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Ted Dunning <td...@maprtech.com>.

Typical best practice is to have a separate file system per spindle.  If
you have a RAID only controller (many are), then you just create one RAID
per spindle.  The effect is the same.

MapR is unusual able to stripe writes over multiple drives organized into a
storage pool, but you will not normally be able to achieve that same level
of performance with ordinary Hadoop by using LVM over JBOD or controller
level RAID.  The problem is that the Java layer doesn't understand that the
storage is striped and the controller doesn't understand what Hadoop is
doing.  MapR schedules all of the writes to individual spindles via a very
fast state machine embedded in the file system.

The comment about striping increasing the impact of a single disk drive is
exactly correct and it makes modeling the failure modes of the system
considerably more complex.  The net result of the modeling that I and
others have done is that moderate to large RAID groups in storage pools for
moderate sized clusters (< 2000 nodes or so) is just fine.  For large
clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
drives or less.

On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:

>  We have seen in several of our Hadoop clusters that LVM degrades
> performance of our M/R jobs, and I remembered a message where
> Ted Dunning was explaining something about this, and since
> that time, we don't use LVM for Hadoop data directories.
>
> About RAID volumes, the best performance that we have achieved
> is using RAID 10 for our Hadoop data directories.
>
>
>
> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com> <mi...@cloudera.com>:
>
>  One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>
>
> --
> Marcos Ortiz Valmaseda,
> Product Manager && Data Scientist at UCI
> Blog: http://marcosluis2186.posterous.com
> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Ted Dunning <td...@maprtech.com>.

Typical best practice is to have a separate file system per spindle.  If
you have a RAID only controller (many are), then you just create one RAID
per spindle.  The effect is the same.

MapR is unusual able to stripe writes over multiple drives organized into a
storage pool, but you will not normally be able to achieve that same level
of performance with ordinary Hadoop by using LVM over JBOD or controller
level RAID.  The problem is that the Java layer doesn't understand that the
storage is striped and the controller doesn't understand what Hadoop is
doing.  MapR schedules all of the writes to individual spindles via a very
fast state machine embedded in the file system.

The comment about striping increasing the impact of a single disk drive is
exactly correct and it makes modeling the failure modes of the system
considerably more complex.  The net result of the modeling that I and
others have done is that moderate to large RAID groups in storage pools for
moderate sized clusters (< 2000 nodes or so) is just fine.  For large
clusters of up to 10,000 nodes, you should probably limit RAID groups to 4
drives or less.

On Sun, Feb 10, 2013 at 7:39 PM, Marcos Ortiz <ml...@uci.cu> wrote:

>  We have seen in several of our Hadoop clusters that LVM degrades
> performance of our M/R jobs, and I remembered a message where
> Ted Dunning was explaining something about this, and since
> that time, we don't use LVM for Hadoop data directories.
>
> About RAID volumes, the best performance that we have achieved
> is using RAID 10 for our Hadoop data directories.
>
>
>
> On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
>
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com> <mi...@cloudera.com>:
>
>  One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari<je...@spaggiari.org> <je...@spaggiari.org> wrote:
>
>
>  Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>
>
> --
> Marcos Ortiz Valmaseda,
> Product Manager && Data Scientist at UCI
> Blog: http://marcosluis2186.posterous.com
> Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Marcos Ortiz <ml...@uci.cu>.

We have seen in several of our Hadoop clusters that LVM degrades
performance of our M/R jobs, and I remembered a message where
Ted Dunning was explaining something about this, and since
that time, we don't use LVM for Hadoop data directories.

About RAID volumes, the best performance that we have achieved
is using RAID 10 for our Hadoop data directories.


On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM

-- 
Marcos Ortiz Valmaseda,
Product Manager && Data Scientist at UCI
Blog: http://marcosluis2186.posterous.com
Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Marcos Ortiz <ml...@uci.cu>.

We have seen in several of our Hadoop clusters that LVM degrades
performance of our M/R jobs, and I remembered a message where
Ted Dunning was explaining something about this, and since
that time, we don't use LVM for Hadoop data directories.

About RAID volumes, the best performance that we have achieved
is using RAID 10 for our Hadoop data directories.


On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM

-- 
Marcos Ortiz Valmaseda,
Product Manager && Data Scientist at UCI
Blog: http://marcosluis2186.posterous.com
Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

@Michael:
I have done some tests between RAID0, 1, JBOD and LVM on another server.

Results are there:
http://www.spaggiari.org/index.php/hbase/hard-drives-performances
LVM and JBOD were close, that's why I talked about LVM, since it seems
to be pretty close to JBOD performance wyse and can be done on any
hardware even if the MB is not proposing any RAID/JBOD option.

@Chris:
I will have to test and see. Like what if I had a drive now to an
existing DataNode? Is it going to spread it's existing data over the 2
drives? Or are they going to grow the same speed?

I will add one drive to one server tomorrow and see the results...
Then I will run some performances tests and see...

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM
>>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Marcos Ortiz <ml...@uci.cu>.

We have seen in several of our Hadoop clusters that LVM degrades
performance of our M/R jobs, and I remembered a message where
Ted Dunning was explaining something about this, and since
that time, we don't use LVM for Hadoop data directories.

About RAID volumes, the best performance that we have achieved
is using RAID 10 for our Hadoop data directories.


On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM

-- 
Marcos Ortiz Valmaseda,
Product Manager && Data Scientist at UCI
Blog: http://marcosluis2186.posterous.com
Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

@Michael:
I have done some tests between RAID0, 1, JBOD and LVM on another server.

Results are there:
http://www.spaggiari.org/index.php/hbase/hard-drives-performances
LVM and JBOD were close, that's why I talked about LVM, since it seems
to be pretty close to JBOD performance wyse and can be done on any
hardware even if the MB is not proposing any RAID/JBOD option.

@Chris:
I will have to test and see. Like what if I had a drive now to an
existing DataNode? Is it going to spread it's existing data over the 2
drives? Or are they going to grow the same speed?

I will add one drive to one server tomorrow and see the results...
Then I will run some performances tests and see...

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM
>>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

@Michael:
I have done some tests between RAID0, 1, JBOD and LVM on another server.

Results are there:
http://www.spaggiari.org/index.php/hbase/hard-drives-performances
LVM and JBOD were close, that's why I talked about LVM, since it seems
to be pretty close to JBOD performance wyse and can be done on any
hardware even if the MB is not proposing any RAID/JBOD option.

@Chris:
I will have to test and see. Like what if I had a drive now to an
existing DataNode? Is it going to spread it's existing data over the 2
drives? Or are they going to grow the same speed?

I will add one drive to one server tomorrow and see the results...
Then I will run some performances tests and see...

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM
>>>
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Marcos Ortiz <ml...@uci.cu>.

We have seen in several of our Hadoop clusters that LVM degrades
performance of our M/R jobs, and I remembered a message where
Ted Dunning was explaining something about this, and since
that time, we don't use LVM for Hadoop data directories.

About RAID volumes, the best performance that we have achieved
is using RAID 10 for our Hadoop data directories.


On 02/10/2013 09:24 PM, Michael Katzenellenbogen wrote:
> Are you able to create multiple RAID0 volumes? Perhaps you can expose
> each disk as its own RAID0 volume...
>
> Not sure why or where LVM comes into the picture here ... LVM is on
> the software layer and (hopefully) the RAID/JBOD stuff is at the
> hardware layer (and in the case of HDFS, LVM will only add unneeded
> overhead).
>
> -Michael
>
> On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> The issue is that my MB is not doing JBOD :( I have RAID only
>> possible, and I'm fighting for the last 48h and still not able to make
>> it work... That's why I'm thinking about using dfs.data.dir instead.
>>
>> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>>
>> What will be better with JBOD against dfs.data.dir? I have done some
>> tests JBOD vs LVM and did not find any pros for JBOD so far.
>>
>> JM
>>
>> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>>> One thought comes to mind: disk failure. In the event a disk goes bad,
>>> then with RAID0, you just lost your entire array. With JBOD, you lost
>>> one disk.
>>>
>>> -Michael
>>>
>>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>>> <je...@spaggiari.org> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a quick question regarding RAID0 performances vs multiple
>>>> dfs.data.dir entries.
>>>>
>>>> Let's say I have 2 x 2TB drives.
>>>>
>>>> I can configure them as 2 separate drives mounted on 2 folders and
>>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>>
>>>> With RAID0, the reads and writes are going to be spread over the 2
>>>> disks. This is significantly increasing the speed. But if I put 2
>>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>>> directories too, and at the end, ths results should the same, no?
>>>>
>>>> Any experience/advice/results to share?
>>>>
>>>> Thanks,
>>>>
>>>> JM

-- 
Marcos Ortiz Valmaseda,
Product Manager && Data Scientist at UCI
Blog: http://marcosluis2186.posterous.com
Twitter: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

Are you able to create multiple RAID0 volumes? Perhaps you can expose
each disk as its own RAID0 volume...

Not sure why or where LVM comes into the picture here ... LVM is on
the software layer and (hopefully) the RAID/JBOD stuff is at the
hardware layer (and in the case of HDFS, LVM will only add unneeded
overhead).

-Michael

On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>> One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>> <je...@spaggiari.org> wrote:
>>
>>> Hi,
>>>
>>> I have a quick question regarding RAID0 performances vs multiple
>>> dfs.data.dir entries.
>>>
>>> Let's say I have 2 x 2TB drives.
>>>
>>> I can configure them as 2 separate drives mounted on 2 folders and
>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>
>>> With RAID0, the reads and writes are going to be spread over the 2
>>> disks. This is significantly increasing the speed. But if I put 2
>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>> directories too, and at the end, ths results should the same, no?
>>>
>>> Any experience/advice/results to share?
>>>
>>> Thanks,
>>>
>>> JM
>>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

Are you able to create multiple RAID0 volumes? Perhaps you can expose
each disk as its own RAID0 volume...

Not sure why or where LVM comes into the picture here ... LVM is on
the software layer and (hopefully) the RAID/JBOD stuff is at the
hardware layer (and in the case of HDFS, LVM will only add unneeded
overhead).

-Michael

On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>> One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>> <je...@spaggiari.org> wrote:
>>
>>> Hi,
>>>
>>> I have a quick question regarding RAID0 performances vs multiple
>>> dfs.data.dir entries.
>>>
>>> Let's say I have 2 x 2TB drives.
>>>
>>> I can configure them as 2 separate drives mounted on 2 folders and
>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>
>>> With RAID0, the reads and writes are going to be spread over the 2
>>> disks. This is significantly increasing the speed. But if I put 2
>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>> directories too, and at the end, ths results should the same, no?
>>>
>>> Any experience/advice/results to share?
>>>
>>> Thanks,
>>>
>>> JM
>>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

Are you able to create multiple RAID0 volumes? Perhaps you can expose
each disk as its own RAID0 volume...

Not sure why or where LVM comes into the picture here ... LVM is on
the software layer and (hopefully) the RAID/JBOD stuff is at the
hardware layer (and in the case of HDFS, LVM will only add unneeded
overhead).

-Michael

On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>> One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>> <je...@spaggiari.org> wrote:
>>
>>> Hi,
>>>
>>> I have a quick question regarding RAID0 performances vs multiple
>>> dfs.data.dir entries.
>>>
>>> Let's say I have 2 x 2TB drives.
>>>
>>> I can configure them as 2 separate drives mounted on 2 folders and
>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>
>>> With RAID0, the reads and writes are going to be spread over the 2
>>> disks. This is significantly increasing the speed. But if I put 2
>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>> directories too, and at the end, ths results should the same, no?
>>>
>>> Any experience/advice/results to share?
>>>
>>> Thanks,
>>>
>>> JM
>>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

Are you able to create multiple RAID0 volumes? Perhaps you can expose
each disk as its own RAID0 volume...

Not sure why or where LVM comes into the picture here ... LVM is on
the software layer and (hopefully) the RAID/JBOD stuff is at the
hardware layer (and in the case of HDFS, LVM will only add unneeded
overhead).

-Michael

On Feb 10, 2013, at 9:19 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> The issue is that my MB is not doing JBOD :( I have RAID only
> possible, and I'm fighting for the last 48h and still not able to make
> it work... That's why I'm thinking about using dfs.data.dir instead.
>
> I have 1 drive per node so far and need to move to 2 to reduce WIO.
>
> What will be better with JBOD against dfs.data.dir? I have done some
> tests JBOD vs LVM and did not find any pros for JBOD so far.
>
> JM
>
> 2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
>> One thought comes to mind: disk failure. In the event a disk goes bad,
>> then with RAID0, you just lost your entire array. With JBOD, you lost
>> one disk.
>>
>> -Michael
>>
>> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
>> <je...@spaggiari.org> wrote:
>>
>>> Hi,
>>>
>>> I have a quick question regarding RAID0 performances vs multiple
>>> dfs.data.dir entries.
>>>
>>> Let's say I have 2 x 2TB drives.
>>>
>>> I can configure them as 2 separate drives mounted on 2 folders and
>>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>>
>>> With RAID0, the reads and writes are going to be spread over the 2
>>> disks. This is significantly increasing the speed. But if I put 2
>>> entries in dfs.data.dir, hadoop is going to spread over those 2
>>> directories too, and at the end, ths results should the same, no?
>>>
>>> Any experience/advice/results to share?
>>>
>>> Thanks,
>>>
>>> JM
>>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

The issue is that my MB is not doing JBOD :( I have RAID only
possible, and I'm fighting for the last 48h and still not able to make
it work... That's why I'm thinking about using dfs.data.dir instead.

I have 1 drive per node so far and need to move to 2 to reduce WIO.

What will be better with JBOD against dfs.data.dir? I have done some
tests JBOD vs LVM and did not find any pros for JBOD so far.

JM

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

The issue is that my MB is not doing JBOD :( I have RAID only
possible, and I'm fighting for the last 48h and still not able to make
it work... That's why I'm thinking about using dfs.data.dir instead.

I have 1 drive per node so far and need to move to 2 to reduce WIO.

What will be better with JBOD against dfs.data.dir? I have done some
tests JBOD vs LVM and did not find any pros for JBOD so far.

JM

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

The issue is that my MB is not doing JBOD :( I have RAID only
possible, and I'm fighting for the last 48h and still not able to make
it work... That's why I'm thinking about using dfs.data.dir instead.

I have 1 drive per node so far and need to move to 2 to reduce WIO.

What will be better with JBOD against dfs.data.dir? I have done some
tests JBOD vs LVM and did not find any pros for JBOD so far.

JM

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.

The issue is that my MB is not doing JBOD :( I have RAID only
possible, and I'm fighting for the last 48h and still not able to make
it work... That's why I'm thinking about using dfs.data.dir instead.

I have 1 drive per node so far and need to move to 2 to reduce WIO.

What will be better with JBOD against dfs.data.dir? I have done some
tests JBOD vs LVM and did not find any pros for JBOD so far.

JM

2013/2/10, Michael Katzenellenbogen <mi...@cloudera.com>:
> One thought comes to mind: disk failure. In the event a disk goes bad,
> then with RAID0, you just lost your entire array. With JBOD, you lost
> one disk.
>
> -Michael
>
> On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
> <je...@spaggiari.org> wrote:
>
>> Hi,
>>
>> I have a quick question regarding RAID0 performances vs multiple
>> dfs.data.dir entries.
>>
>> Let's say I have 2 x 2TB drives.
>>
>> I can configure them as 2 separate drives mounted on 2 folders and
>> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
>> with RAID0 and assigned them as a single folder to dfs.data.dir.
>>
>> With RAID0, the reads and writes are going to be spread over the 2
>> disks. This is significantly increasing the speed. But if I put 2
>> entries in dfs.data.dir, hadoop is going to spread over those 2
>> directories too, and at the end, ths results should the same, no?
>>
>> Any experience/advice/results to share?
>>
>> Thanks,
>>
>> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

One thought comes to mind: disk failure. In the event a disk goes bad,
then with RAID0, you just lost your entire array. With JBOD, you lost
one disk.

-Michael

On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM

Re: Mutiple dfs.data.dir vs RAID0

Posted by Chris Embree <ce...@gmail.com>.

Interesting question.  You'd probably need to benchmark to prove it out.

I'm not the exact details of how HDFS stripes data, but it should compare
pretty well to hardware RAID.

Conceptually, HDFS should be able to out perform a RAID solution, since
HDFS "knows" more about the data being written.  One of the benefits of
HDFS is being able to buy cheaper hardware and still getting good
performance.

We bought cheap DL165's for our datanodes.  4x 2TB Drives with no RAID.

On Sun, Feb 10, 2013 at 8:57 PM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Chris Embree <ce...@gmail.com>.

Interesting question.  You'd probably need to benchmark to prove it out.

I'm not the exact details of how HDFS stripes data, but it should compare
pretty well to hardware RAID.

Conceptually, HDFS should be able to out perform a RAID solution, since
HDFS "knows" more about the data being written.  One of the benefits of
HDFS is being able to buy cheaper hardware and still getting good
performance.

We bought cheap DL165's for our datanodes.  4x 2TB Drives with no RAID.

On Sun, Feb 10, 2013 at 8:57 PM, Jean-Marc Spaggiari <
jean-marc@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM
>

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

One thought comes to mind: disk failure. In the event a disk goes bad,
then with RAID0, you just lost your entire array. With JBOD, you lost
one disk.

-Michael

On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

One thought comes to mind: disk failure. In the event a disk goes bad,
then with RAID0, you just lost your entire array. With JBOD, you lost
one disk.

-Michael

On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM

Re: Mutiple dfs.data.dir vs RAID0

Posted by Michael Katzenellenbogen <mi...@cloudera.com>.

One thought comes to mind: disk failure. In the event a disk goes bad,
then with RAID0, you just lost your entire array. With JBOD, you lost
one disk.

-Michael

On Feb 10, 2013, at 8:58 PM, Jean-Marc Spaggiari
<je...@spaggiari.org> wrote:

> Hi,
>
> I have a quick question regarding RAID0 performances vs multiple
> dfs.data.dir entries.
>
> Let's say I have 2 x 2TB drives.
>
> I can configure them as 2 separate drives mounted on 2 folders and
> assignes to hadoop using dfs.data.dir. Or I can mount the 2 drives
> with RAID0 and assigned them as a single folder to dfs.data.dir.
>
> With RAID0, the reads and writes are going to be spread over the 2
> disks. This is significantly increasing the speed. But if I put 2
> entries in dfs.data.dir, hadoop is going to spread over those 2
> directories too, and at the end, ths results should the same, no?
>
> Any experience/advice/results to share?
>
> Thanks,
>
> JM