You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2009/02/25 22:41:01 UTC

[jira] Created: (HBASE-1224) Scanner returns values from before startrow

Scanner returns values from before startrow
-------------------------------------------

                 Key: HBASE-1224
                 URL: https://issues.apache.org/jira/browse/HBASE-1224
             Project: Hadoop HBase
          Issue Type: Bug
            Reporter: stack
            Priority: Critical
             Fix For: 0.19.1, 0.20.0


Ben Maurer reports our returning rows from before startrow -- lots of concurrent scanners run out of mapreduce jobs that are updating rows as they go.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HBASE-1224) Scanner returns values from before startrow

Posted by "stack (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HBASE-1224:
-------------------------

    Attachment: 1224-test.patch

> Scanner returns values from before startrow
> -------------------------------------------
>
>                 Key: HBASE-1224
>                 URL: https://issues.apache.org/jira/browse/HBASE-1224
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Priority: Critical
>             Fix For: 0.19.1, 0.20.0
>
>         Attachments: 1224-test.patch, 1224.patch, ben-1224.patch
>
>
> Ben Maurer reports our returning rows from before startrow -- lots of concurrent scanners run out of mapreduce jobs that are updating rows as they go.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (HBASE-1224) Scanner returns values from before startrow

Posted by "stack (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack resolved HBASE-1224.
--------------------------

       Resolution: Fixed
    Fix Version/s:     (was: 0.20.0)
     Hadoop Flags: [Reviewed]

Applied Ben's patch and my test to 0.19 branch.  TRUNK doesn't have this issue with hfile.

> Scanner returns values from before startrow
> -------------------------------------------
>
>                 Key: HBASE-1224
>                 URL: https://issues.apache.org/jira/browse/HBASE-1224
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Priority: Critical
>             Fix For: 0.19.1
>
>         Attachments: 1224-test.patch, 1224.patch, ben-1224.patch
>
>
> Ben Maurer reports our returning rows from before startrow -- lots of concurrent scanners run out of mapreduce jobs that are updating rows as they go.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HBASE-1224) Scanner returns values from before startrow

Posted by "stack (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HBASE-1224:
-------------------------

    Attachment: 1224.patch

Ben digging turned up fact that HalfMapFile sets current position back to midkey when its the top-half of a file and its first invocation of next.

Trying to reproduce in a unit test, I see this happening but higher up in HStoreFileScanner, while the wrong answer comes up out of the HalfMapFile, HSFS keeps iterating the HalfMapFile until it starts returning right answers.  Something more subtle is going on.  Meantime this patch makes halfmapfile do the right thing.  Maybe it fixes Bens issue.

> Scanner returns values from before startrow
> -------------------------------------------
>
>                 Key: HBASE-1224
>                 URL: https://issues.apache.org/jira/browse/HBASE-1224
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Priority: Critical
>             Fix For: 0.19.1, 0.20.0
>
>         Attachments: 1224.patch
>
>
> Ben Maurer reports our returning rows from before startrow -- lots of concurrent scanners run out of mapreduce jobs that are updating rows as they go.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HBASE-1224) Scanner returns values from before startrow

Posted by "Ben Maurer (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ben Maurer updated HBASE-1224:
------------------------------

    Attachment: ben-1224.patch

My version of the patch

> Scanner returns values from before startrow
> -------------------------------------------
>
>                 Key: HBASE-1224
>                 URL: https://issues.apache.org/jira/browse/HBASE-1224
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Priority: Critical
>             Fix For: 0.19.1, 0.20.0
>
>         Attachments: 1224.patch, ben-1224.patch
>
>
> Ben Maurer reports our returning rows from before startrow -- lots of concurrent scanners run out of mapreduce jobs that are updating rows as they go.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (HBASE-1224) Scanner returns values from before startrow

Posted by "stack (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack reassigned HBASE-1224:
----------------------------

    Assignee: Ben Maurer

> Scanner returns values from before startrow
> -------------------------------------------
>
>                 Key: HBASE-1224
>                 URL: https://issues.apache.org/jira/browse/HBASE-1224
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Assignee: Ben Maurer
>            Priority: Critical
>             Fix For: 0.19.1
>
>         Attachments: 1224-test.patch, 1224.patch, ben-1224.patch
>
>
> Ben Maurer reports our returning rows from before startrow -- lots of concurrent scanners run out of mapreduce jobs that are updating rows as they go.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.