You are viewing a plain text version of this content. The canonical link for it is here.

Posted to codereview@trafodion.apache.org by DaveBirdsall <gi...@git.apache.org> on 2016/01/07 22:22:47 UTC

[GitHub] incubator-trafodion pull request: [TRAFODION-1740] Add CQDs to UPD...

GitHub user DaveBirdsall opened a pull request:

    https://github.com/apache/incubator-trafodion/pull/253

    [TRAFODION-1740] Add CQDs to UPDATE STATS for large tables

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/DaveBirdsall/incubator-trafodion Trafodion1740

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-trafodion/pull/253.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #253
    
----
commit 99a143f29910c7d3c8e02fb1bf101214568bf80a
Author: Dave Birdsall <db...@apache.org>
Date:   2016-01-07T21:18:51Z

    [TRAFODION-1740] Add CQDs to UPDATE STATS for large tables

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

[GitHub] incubator-trafodion pull request: [TRAFODION-1740] Add CQDs to UPD...

Posted by DaveBirdsall <gi...@git.apache.org>.

Github user DaveBirdsall commented on a diff in the pull request:

    https://github.com/apache/incubator-trafodion/pull/253#discussion_r49136585
  
    --- Diff: core/sql/ustat/hs_globals.cpp ---
    @@ -3762,6 +3765,68 @@ Lng32 HSSample::make(NABoolean rowCountIsEstimate, // input
         // For Hive tables the sample table used is a Trafodion table
         if (hs_globals->isHbaseTable || hs_globals->isHiveTable)
           {
    +        // The optimal degree of parallelism for the LOAD or UPSERT is
    +        // the number of partitions of the original table. Force that.
    +        // Note that when the default for AGGRESSIVE_ESP_ALLOCATION_PER_CORE
    +        // is permanently changed to 'ON', we may be able to remove this CQD.
    +        if (hs_globals->objDef->getNumPartitions() > 1)
    +          {
    +            char temp[40];  // way more space than needed, but it's safe
    +            sprintf(temp,"'%d'",hs_globals->objDef->getNumPartitions());
    +            NAString EspsCQD = "CONTROL QUERY DEFAULT PARALLEL_NUM_ESPS ";
    +            EspsCQD += temp;
    +            HSFuncExecQuery(EspsCQD);
    +            EspCQDUsed = TRUE;  // remember to reset later
    +          }
    +
    +        // If the table is very large, we risk HBase time-outs because the
    +        // sample scan doesn't return rows fast enough. In this case, we
    +        // want to reduce the HBase row cache size to a smaller number to
    +        // force more frequent returns. Experience shows that a value of
    +        // '10' worked well with a 17.7 billion row table with 128 regions
    +        // on six nodes (one million row sample). We'll assume a workable
    +        // HBase cache size value scales linearly with the sampling ratio.
    +        // That is, we'll assume the model:
    +        //
    +        //   workable value = (sample row count / actual row count) * c,
    +        //   where c is chosen so that we get 10 when the sample row count
    +        //   is 1,000,000 and the actual row count is 17.7 billion.
    +        //
    +        //   Solving for c, we get c = 10 * (17.7 billion/1 million).
    +        //
    +        // Note that the Generator does a similar calculation in
    +        // Generator::setHBaseNumCacheRows. The calculation here is more
    +        // conservative because we care more about getting UPDATE STATISTICS
    +        // done without a timeout, trading off possible speed improvements
    +        // by using a smaller cache size.
    +        //
    +        // Note that when we move to HBase 1.1, with its heartbeat protocol,
    +        // this time-out problem goes away and we can remove these CQDs.
    +        if (hs_globals->isHbaseTable)
    +          {
    +            double sampleRatio = (double)(sampleRowCnt) / hs_globals->actualRowCount;
    +            double calibrationFactor = 10 * (17700000000/1000000);
    +            Int64 workableCacheSize = (Int64)(sampleRatio * calibrationFactor);
    +            if (workableCacheSize < 1)
    +              workableCacheSize = 1;  // can't go below 1 unfortunately
    +
    +            Int32 max = getDefaultAsLong(HBASE_NUM_CACHE_ROWS_MAX);
    +            if ((workableCacheSize < 10000) && // don't bother if 10000 works
    +                (max == 10000))  // don't do it if user has already set this CQD
    +              {
    +                char temp1[40];  // way more space than needed, but it's safe
    +                Lng32 wcs = (Lng32)workableCacheSize;  
    +                sprintf(temp1,"'%d'",wcs);
    +                NAString minCQD = "CONTROL QUERY HBASE_NUM_CACHE_ROWS_MIN ";
    --- End diff --
    
    Bug: Need to add the DEFAULT keyword after CONTROL QUERY


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

[GitHub] incubator-trafodion pull request: [TRAFODION-1740] Add CQDs to UPD...

Posted by DaveBirdsall <gi...@git.apache.org>.

Github user DaveBirdsall commented on a diff in the pull request:

    https://github.com/apache/incubator-trafodion/pull/253#discussion_r49136610
  
    --- Diff: core/sql/ustat/hs_globals.cpp ---
    @@ -3762,6 +3765,68 @@ Lng32 HSSample::make(NABoolean rowCountIsEstimate, // input
         // For Hive tables the sample table used is a Trafodion table
         if (hs_globals->isHbaseTable || hs_globals->isHiveTable)
           {
    +        // The optimal degree of parallelism for the LOAD or UPSERT is
    +        // the number of partitions of the original table. Force that.
    +        // Note that when the default for AGGRESSIVE_ESP_ALLOCATION_PER_CORE
    +        // is permanently changed to 'ON', we may be able to remove this CQD.
    +        if (hs_globals->objDef->getNumPartitions() > 1)
    +          {
    +            char temp[40];  // way more space than needed, but it's safe
    +            sprintf(temp,"'%d'",hs_globals->objDef->getNumPartitions());
    +            NAString EspsCQD = "CONTROL QUERY DEFAULT PARALLEL_NUM_ESPS ";
    +            EspsCQD += temp;
    +            HSFuncExecQuery(EspsCQD);
    +            EspCQDUsed = TRUE;  // remember to reset later
    +          }
    +
    +        // If the table is very large, we risk HBase time-outs because the
    +        // sample scan doesn't return rows fast enough. In this case, we
    +        // want to reduce the HBase row cache size to a smaller number to
    +        // force more frequent returns. Experience shows that a value of
    +        // '10' worked well with a 17.7 billion row table with 128 regions
    +        // on six nodes (one million row sample). We'll assume a workable
    +        // HBase cache size value scales linearly with the sampling ratio.
    +        // That is, we'll assume the model:
    +        //
    +        //   workable value = (sample row count / actual row count) * c,
    +        //   where c is chosen so that we get 10 when the sample row count
    +        //   is 1,000,000 and the actual row count is 17.7 billion.
    +        //
    +        //   Solving for c, we get c = 10 * (17.7 billion/1 million).
    +        //
    +        // Note that the Generator does a similar calculation in
    +        // Generator::setHBaseNumCacheRows. The calculation here is more
    +        // conservative because we care more about getting UPDATE STATISTICS
    +        // done without a timeout, trading off possible speed improvements
    +        // by using a smaller cache size.
    +        //
    +        // Note that when we move to HBase 1.1, with its heartbeat protocol,
    +        // this time-out problem goes away and we can remove these CQDs.
    +        if (hs_globals->isHbaseTable)
    +          {
    +            double sampleRatio = (double)(sampleRowCnt) / hs_globals->actualRowCount;
    +            double calibrationFactor = 10 * (17700000000/1000000);
    +            Int64 workableCacheSize = (Int64)(sampleRatio * calibrationFactor);
    +            if (workableCacheSize < 1)
    +              workableCacheSize = 1;  // can't go below 1 unfortunately
    +
    +            Int32 max = getDefaultAsLong(HBASE_NUM_CACHE_ROWS_MAX);
    +            if ((workableCacheSize < 10000) && // don't bother if 10000 works
    +                (max == 10000))  // don't do it if user has already set this CQD
    +              {
    +                char temp1[40];  // way more space than needed, but it's safe
    +                Lng32 wcs = (Lng32)workableCacheSize;  
    +                sprintf(temp1,"'%d'",wcs);
    +                NAString minCQD = "CONTROL QUERY HBASE_NUM_CACHE_ROWS_MIN ";
    +                minCQD += temp1;
    +                HSFuncExecQuery(minCQD); 
    +                NAString maxCQD = "CONTROL QUERY HBASE_NUM_CACHE_ROWS_MAX ";
    --- End diff --
    
    Same here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

[GitHub] incubator-trafodion pull request: [TRAFODION-1740] Add CQDs to UPD...

Posted by asfgit <gi...@git.apache.org>.

Github user asfgit closed the pull request at:

    https://github.com/apache/incubator-trafodion/pull/253


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---