You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@phoenix.apache.org by "Abhishek Singh Chouhan (JIRA)" <ji...@apache.org> on 2018/03/27 13:07:00 UTC

[jira] [Created] (PHOENIX-4674) Incorrect stats if data size is less than guidepost width

Abhishek Singh Chouhan created PHOENIX-4674:
-----------------------------------------------

             Summary: Incorrect stats if data size is less than guidepost width
                 Key: PHOENIX-4674
                 URL: https://issues.apache.org/jira/browse/PHOENIX-4674
             Project: Phoenix
          Issue Type: Bug
            Reporter: Abhishek Singh Chouhan


For a small table, lets say with a single region < guidepost width, the stats after running update statistics can be way off. This is because we get an empty guidepost for the region and in BaseResultIterators we end up estimating the #rows as guidepostwidth/estimated row size of the table. For a table having <100 rows and guidepost width size of 100 mb, if the estimated row size is 100 bytes we end up estimating a million rows.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)