You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Vasu Mariyala (JIRA)" <ji...@apache.org> on 2014/02/15 02:03:19 UTC

[jira] [Updated] (HBASE-10546) Two scanner objects are open for each hbase map task but only one scanner object is closed

     [ https://issues.apache.org/jira/browse/HBASE-10546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vasu Mariyala updated HBASE-10546:
----------------------------------

    Attachment: 0.94-HBASE-10546.patch

> Two scanner objects are open for each hbase map task but only one scanner object is closed
> ------------------------------------------------------------------------------------------
>
>                 Key: HBASE-10546
>                 URL: https://issues.apache.org/jira/browse/HBASE-10546
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Vasu Mariyala
>         Attachments: 0.94-HBASE-10546.patch
>
>
> Map reduce framework calls createRecordReader of the TableInputFormat/MultiTableInputFormat to get the record reader instance. In this method, we are initializing the TableRecordReaderImpl (restart method). This initializes the scanner object. After this, map reduce framework calls initialize on the RecordReader. In our case, this calls restart of the TableRecordReaderImpl again. Here, it doesn't close the first scanner. At the end of the task, only the second scanner object is closed. Because of this, the smallest read point of HRegion is affected.
> We don't need to initialize the RecordReader in the createRecordReader method and we need to close the scanner object in the restart method. (incase if the restart method is called because of exceptions in the nextKeyValue method)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)