You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hadoop.apache.org by Jameson Li <ho...@gmail.com> on 2012/10/22 06:07:21 UTC

hadoop 0.20 hdfs archives

Hi,

Our hadoop version is hadoop-0.20-append.
When using hadoop archives, I can see that the file "_index" and
"_masterindex" is set to "repl=10" and the har part file is set to
“repl=3”(see below the fsck show).
So I have a doubt that as the har part file only has repl 3, when the block
is missing, so the har part will be broken?
If I want add the har file's high availability, after the archive mapreduce
I have to manually set the full file's repl to a high number as 10?

The fsck info:
hadoop fsck /hadoop/archive -files -blocks -racks
/hadoop/archive/t.har <dir>
/hadoop/archive/t.har/_index 34433 bytes, 1 block(s):  OK
0. blk_8341000632547263306_70582655 len=34433 repl=10
[/default/G06-202/x.y.z.126:50010, /default/G06-202/x.y.z.127:50010,
/default/B04-203/x.y.z.99:50010, /default/D08-203/x.y.z.100:50010,
/default/F04-203/x.y.z.144:50010, /default/G11-202/x.y.z.86:50010,
/default/G07-202/x.y.z.129:50010, /default/B04-203/x.y.z.91:50010,
/default/B05-203/x.y.z.61:50010, /default/B05-203/x.y.z.63:50010]

/hadoop/archive/t.har/_masterindex 25 bytes, 1 block(s):  OK
0. blk_5449096664018130780_70582655 len=25 repl=10
[/default/G06-202/x.y.z.125:50010, /default/G06-202/x.y.z.119:50010,
/default/B04-203/x.y.z.99:50010, /default/A04-203/x.y.z.56:50010,
/default/B06-203/x.y.z.69:50010, /default/G9-202/x.y.z.74:50010,
/default/G10-202/x.y.z.78:50010, /default/G10-202/x.y.z.77:50010,
/default/B04-203/x.y.z.91:50010, /default/B05-203/x.y.z.62:50010]

/hadoop/archive/t.har/part-0 133722259 bytes, 1 block(s):  OK
0. blk_-9118198399046694176_70582469 len=133722259 repl=3
[/default/B04-203/x.y.z.93:50010, /default/B04-203/x.y.z.91:50010,
/default/D08-203/x.y.z.108:50010]

Status: HEALTHY
 Total size: 133756717 B
 Total dirs: 1
 Total files: 3
 Total blocks (validated): 3 (avg. block size 44585572 B)
 Minimally replicated blocks: 3 (100.0 %)
 Over-replicated blocks: 0 (0.0 %)
 Under-replicated blocks: 0 (0.0 %)
 Mis-replicated blocks: 0 (0.0 %)
 Default replication factor: 3
 Average block replication: 7.6666665
 Corrupt blocks: 0
 Missing replicas: 0 (0.0 %)
 Number of data-nodes: xx
 Number of racks: yy


专注于Mysql,MSSQL,Oracle,Hadoop