You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Elliott Clark (JIRA)" <ji...@apache.org> on 2013/06/11 00:02:20 UTC

[jira] [Commented] (HBASE-8723) HBase Intgration tests are failing because of new defaults.

    [ https://issues.apache.org/jira/browse/HBASE-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13679964#comment-13679964 ] 

Elliott Clark commented on HBASE-8723:
--------------------------------------

My hdfs-site has lots of settings that should make fail over faster but from kill to region open is still long enough that we fail it tests:
{code}
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>

  <property>
    <name>dfs.datanode.handler.count</name>
    <!-- default 10 -->
    <value>32</value>
    <description>The number of server threads for the
    datanode.</description>
  </property>

  <property>
    <name>dfs.namenode.handler.count</name>
    <!-- default 10 -->
    <value>32</value>
    <description>The number of server threads for the
    namenode.</description>
  </property>

  <property>
    <name>dfs.block.size</name>
    <value>134217728</value>
    <description>The default block size for new files.</description>
  </property>

  <property>
    <name>dfs.datanode.max.xcievers</name>
    <value>4098</value>
  </property>

  <property>
    <name>dfs.namenode.replication.interval</name>
    <value>15</value>
  </property>

  <property>
    <name>dfs.balance.bandwidthPerSec</name>
    <value>10485760</value>
  </property>

  <property>
    <name>fs.checkpoint.dir</name>
    <value>${hadoop.data.dir1}/dfs/namesecondary</value>
  </property>

  <property>
    <name>dfs.name.dir</name>
    <value>${hadoop.data.dir0}/dfs/name</value>
  </property>

  <property>
    <name>dfs.data.dir</name>
    <value>${hadoop.data.dir0}/dfs/data,${hadoop.data.dir1}/dfs/data,${hadoop.data.dir2}/dfs/data,${hadoop.data.dir3}/dfs/data,${hadoop.data.dir4}/dfs/data,${hadoop.data.dir5}/dfs/data,${hadoop.data.dir6}/dfs/data</value>
  </property>

  <property>
    <name>dfs.datanode.socket.write.timeout</name>
    <value>10000</value>
  </property>

  <property>
    <name>ipc.client.connect.timeout</name>
    <value>1000</value>
  </property>

  <property>
    <name>ipc.client.connect.max.retries.on.timeouts</name>
    <value>2</value>
  </property>

  <property>
    <name>dfs.socket.timeout</name>
    <value>5000</value>
  </property>

  <property>
    <name>dfs.socket.write.timeout</name>
    <value>5000</value>
  </property>

  <property>
    <name>dfs.domain.socket.path</name>
    <value>/var/lib/hadoop/dn_socket._PORT</value>
  </property>

  <property>
    <name>dfs.block.local-path-access.user</name>
    <value>hbase</value>
  </property>

  <property>
     <name>dfs.client.read.shortcircuit.skip.checksum</name>
     <value>true</value>
   </property>

  <property>
    <name>dfs.client.file-block-storage-locations.timeout</name>
    <value>3000</value>
  </property>

</configuration>
{code}


                
> HBase Intgration tests are failing because of new defaults.
> -----------------------------------------------------------
>
>                 Key: HBASE-8723
>                 URL: https://issues.apache.org/jira/browse/HBASE-8723
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Elliott Clark
>            Priority: Blocker
>
> Currently any IT tests that have chaos monkey fail because we are not recovering regions before the number of RPC reties is exhausted.
> We should set that default higher.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira