You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "ellen johansen (JIRA)" <ji...@apache.org> on 2014/12/11 17:03:13 UTC

[jira] [Created] (HADOOP-11391) Enabling HVE/node awareness does not rebalance replicas on data that existed prior to topology changes.

ellen johansen created HADOOP-11391:
---------------------------------------

             Summary: Enabling HVE/node awareness does not rebalance replicas on data that existed prior to topology changes. 
                 Key: HADOOP-11391
                 URL: https://issues.apache.org/jira/browse/HADOOP-11391
             Project: Hadoop Common
          Issue Type: Bug
         Environment: VMWare w/ local storage
            Reporter: ellen johansen


Enabling HVE/node awareness does not rebalance replicas on data that existed prior to topology changes. 

[root@vmw-d10-001 jenkins]# more /opt/cloudera/topology.data 
10.20.xxx.161   /rack1/nodegroup1
10.20.xxx.162   /rack1/nodegroup1
10.20.xxx.163   /rack3/nodegroup1
10.20.xxx.164   /rack3/nodegroup1
172.17.xxx.71   /rack2/nodegroup1
172.17.xxx.72   /rack2/nodegroup1

before HVE:
/user/impalauser/tpcds/store_sales <dir>
/user/impalauser/tpcds/store_sales/store_sales.dat 1180463121 bytes, 9 block(s):  OK
0. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742xxx_1382 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 10.20.xxx.163:20002]
1. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742213_1389 len=134217728 repl=3 [10.20.xxx.164:20002, 172.17.xxx.72:20002, 10.20.xxx.161:20002]
2. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742214_1390 len=134217728 repl=3 [10.20.xxx.164:20002, 172.17.xxx.72:20002, 10.20.xxx.163:20002]
3. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742215_1391 len=134217728 repl=3 [10.20.xxx.164:20002, 172.17.xxx.72:20002, 10.20.xxx.163:20002]
4. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742216_1392 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]
5. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742217_1393 len=134217728 repl=3 [10.20.xxx.164:20002, 172.17.xxx.72:20002, 10.20.xxx.163:20002]
6. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742220_1396 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.162:20002, 10.20.xxx.163:20002]
7. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742222_1398 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.163:20002, 10.20.xxx.161:20002]
8. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742224_1400 len=106721297 repl=3 [10.20.xxx.164:20002, 10.20.xxx.162:20002, 172.17.xxx.72:20002]
---------

Before enabling HVE:
Status: HEALTHY
 Total size:	1648156454 B (Total open files size: 498 B)
 Total dirs:	138
 Total files:	384
 Total symlinks:		0 (Files currently being written: 6)
 Total blocks (validated):	390 (avg. block size 4226042 B) (Total open file blocks (not validated): 6)
 Minimally replicated blocks:	390 (100.0 %)
 Over-replicated blocks:	0 (0.0 %)
 Under-replicated blocks:	1 (0.25641027 %)
 Mis-replicated blocks:		0 (0.0 %)
 Default replication factor:	3
 Average block replication:	2.8564103
 Corrupt blocks:		0
 Missing replicas:		5 (0.44682753 %)
 Number of data-nodes:		5
 Number of racks:		1
FSCK ended at Wed Dec 10 14:04:35 EST 2014 in 50 milliseconds

The filesystem under path '/' is HEALTHY

------
after HVE (and NN restart):

/user/impalauser/tpcds/store_sales <dir>
/user/impalauser/tpcds/store_sales/store_sales.dat 1180463121 bytes, 9 block(s):  OK
0. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742xxx_1382 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.163:20002, 10.20.xxx.161:20002]
1. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742213_1389 len=134217728 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.161:20002]
2. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742214_1390 len=134217728 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.163:20002]
3. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742215_1391 len=134217728 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.163:20002]
4. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742216_1392 len=134217728 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.161:20002]
5. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742217_1393 len=134217728 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.163:20002]
6. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742220_1396 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.163:20002, 10.20.xxx.162:20002]
7. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742222_1398 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.163:20002, 10.20.xxx.161:20002]
8. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073742224_1400 len=106721297 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.162:20002]

Status: HEALTHY
 Total size:	1659427036 B (Total open files size: 498 B)
 Total dirs:	176
 Total files:	529
 Total symlinks:		0 (Files currently being written: 6)
 Total blocks (validated):	532 (avg. block size 3119223 B) (Total open file blocks (not validated): 6)
 Minimally replicated blocks:	532 (100.0 %)
 Over-replicated blocks:	0 (0.0 %)
 Under-replicated blocks:	1 (0.18796992 %)
 Mis-replicated blocks:		0 (0.0 %)
 Default replication factor:	3
 Average block replication:	2.8383458
 Corrupt blocks:		0
 Missing replicas:		7 (0.46143705 %)
 Number of data-nodes:		5
 Number of racks:		3
FSCK ended at Wed Dec 10 14:29:23 EST 2014 in 115 milliseconds
The filesystem under path '/' is HEALTHY

------------

store sales pushed to hdfs after HVE was configured:

/user/impalauser/tpcds_after_hve <dir>
/user/impalauser/tpcds_after_hve/store_sales.dat 1180463121 bytes, 9 block(s):  OK
0. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743406_2582 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]
1. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743412_2588 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.162:20002, 172.17.xxx.72:20002]
2. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743415_2591 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]
3. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743416_2592 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.162:20002, 172.17.xxx.72:20002]
4. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743417_2593 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]
5. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743418_2594 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]
6. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743419_2595 len=134217728 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]
7. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743422_2598 len=134217728 repl=3 [172.17.xxx.72:20002, 10.20.xxx.164:20002, 10.20.xxx.162:20002]
8. BP-1184748135-172.17.xxx.71-1418235396548:blk_1073743423_2599 len=106721297 repl=3 [10.20.xxx.164:20002, 10.20.xxx.161:20002, 172.17.xxx.72:20002]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)