You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "John Sichi (JIRA)" <ji...@apache.org> on 2010/08/11 04:07:16 UTC

[jira] Commented: (HIVE-1495) supply correct information to hooks and lineage for index rebuild

    [ https://issues.apache.org/jira/browse/HIVE-1495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897113#action_12897113 ] 

John Sichi commented on HIVE-1495:
----------------------------------

The lineage looks correct now for REBUILD; maybe we should set a comment in the schema for  to make it clear that they are virtual columns?

What about input/output?  In the example below, compare REBUILD (no input/output, only lineage) with the query which follows (input, output, and lineage).

{noformat}
PREHOOK: query: ALTER INDEX src_index ON src REBUILD
PREHOOK: type: QUERY
POSTHOOK: query: ALTER INDEX src_index ON src REBUILD
POSTHOOK: type: QUERY
POSTHOOK: Lineage: default__src_src_index__._bucketname SIMPLE [(src)src.FieldSchema(name:INPUT__FILE__NAME, type:string, comment:), ]
POSTHOOK: Lineage: default__src_src_index__._offsets EXPRESSION [(src)src.FieldSchema(name:BLOCK__OFFSET__INSIDE__FILE, type:bigint, comment:), ]
POSTHOOK: Lineage: default__src_src_index__.key SIMPLE [(src)src.FieldSchema(name:key, type:string, comment:default), ]
PREHOOK: query: SELECT x.* FROM default__src_src_index__ x ORDER BY key
PREHOOK: type: QUERY
PREHOOK: Input: default@default__src_src_index__
PREHOOK: Output: file:/var/folders/6g/6grtCwPMEf4sqHUPpy6xQG9ByHg/-Tmp-/heyongqiang/hive_2010-08-10_15-32-36_632_4988408731924434502/-mr-10000
POSTHOOK: query: SELECT x.* FROM default__src_src_index__ x ORDER BY key
POSTHOOK: type: QUERY
POSTHOOK: Input: default@default__src_src_index__
POSTHOOK: Output: file:/var/folders/6g/6grtCwPMEf4sqHUPpy6xQG9ByHg/-Tmp-/heyongqiang/hive_2010-08-10_15-32-36_632_4988408731924434502/-mr-10000
POSTHOOK: Lineage: default__src_src_index__._bucketname SIMPLE [(src)src.FieldSchema(name:INPUT__FILE__NAME, type:string, comment:), ]
POSTHOOK: Lineage: default__src_src_index__._offsets EXPRESSION [(src)src.FieldSchema(name:BLOCK__OFFSET__INSIDE__FILE, type:bigint, comment:), ]
POSTHOOK: Lineage: default__src_src_index__.key SIMPLE [(src)src.FieldSchema(name:key, type:string, comment:default), ]
{noformat}


> supply correct information to hooks and lineage for index rebuild
> -----------------------------------------------------------------
>
>                 Key: HIVE-1495
>                 URL: https://issues.apache.org/jira/browse/HIVE-1495
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Indexing
>    Affects Versions: 0.7.0
>            Reporter: John Sichi
>            Assignee: He Yongqiang
>             Fix For: 0.7.0
>
>         Attachments: hive-1495.1.patch
>
>
> This is a followup for HIVE-417.  
> Ashish can probably help on how this should work.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.