You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2017/08/10 22:44:00 UTC

[jira] [Created] (HIVE-17295) Make MemoryManagerImpl.ROWS_BETWEEN_CHECKS configurable

Eugene Koifman created HIVE-17295:
-------------------------------------

             Summary: Make MemoryManagerImpl.ROWS_BETWEEN_CHECKS configurable
                 Key: HIVE-17295
                 URL: https://issues.apache.org/jira/browse/HIVE-17295
             Project: Hive
          Issue Type: Bug
            Reporter: Eugene Koifman
            Assignee: Eugene Koifman


currently addedRow() looks like
{noformat}
public void addedRow(int rows) throws IOException {
    rowsAddedSinceCheck += rows;
    if (rowsAddedSinceCheck >= ROWS_BETWEEN_CHECKS) {
      notifyWriters();
    }
  }
{noformat}

it would be convenient for testing to set ROWS_BETWEEN_CHECKS to a low value so that we can generate multiple stripes with very little data.

Currently the only way to do this is to create a new MemoryManager that overrides this method and install it via OrcFile.WriterOptions but this only works when you have control over creating the Writer.

There is no way to do this via some set of config params to make Hive query for example, create multiple stripes with little data.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)