You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "jiraposter@reviews.apache.org (JIRA)" <ji...@apache.org> on 2011/07/05 21:03:18 UTC

[jira] [Commented] (HIVE-2128) Automatic Indexing with multiple tables

    [ https://issues.apache.org/jira/browse/HIVE-2128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13060075#comment-13060075 ] 

jiraposter@reviews.apache.org commented on HIVE-2128:
-----------------------------------------------------


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1010/
-----------------------------------------------------------

Review request for hive and John Sichi.


Summary
-------

Grab the indexed tables during optimized query generation, grab the associated path URIs, and keep those around in the Configuration object. When the job is passed to ExecDriver, this data is extracted and used in HiveIndexedInputFormat to decide whether to use the index file or delegate to the parent (HiveInputFormat) class. Not sure if this is robust. 


This addresses bug HIVE-2128.
    https://issues.apache.org/jira/browse/HIVE-2128


Diffs
-----

  ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 090ecfc 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexQueryContext.java 617723e 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexedInputFormat.java f1ee95d 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 61bbbf5 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 7c91946 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java dbc489f 
  ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java a03a9a6 

Diff: https://reviews.apache.org/r/1010/diff


Testing
-------

added new testcase index_auto_mult_tables.q


Thanks,

Syed



> Automatic Indexing with multiple tables
> ---------------------------------------
>
>                 Key: HIVE-2128
>                 URL: https://issues.apache.org/jira/browse/HIVE-2128
>             Project: Hive
>          Issue Type: Improvement
>          Components: Indexing
>    Affects Versions: 0.8.0
>            Reporter: Russell Melick
>         Attachments: HIVE-2128.1.patch
>
>
> Make automatic indexing work with jobs which access multiple tables.  We'll probably need to modify the way that the index input format works in order to associate index formats/files with specific tables.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira