You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Laszlo Pinter (Jira)" <ji...@apache.org> on 2019/10/24 09:10:00 UTC
[jira] [Commented] (ORC-562) Don't wrap readerSchema in acidSchema,
if readerSchema is already acid
[ https://issues.apache.org/jira/browse/ORC-562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16958695#comment-16958695 ]
Laszlo Pinter commented on ORC-562:
-----------------------------------
[~omalley] Could you please review/commit this change? Also, I cannot assign this Jira to myself. Thanks
> Don't wrap readerSchema in acidSchema, if readerSchema is already acid
> ----------------------------------------------------------------------
>
> Key: ORC-562
> URL: https://issues.apache.org/jira/browse/ORC-562
> Project: ORC
> Issue Type: Bug
> Components: Java
> Affects Versions: 1.5.6, 1.6.0
> Reporter: Laszlo Pinter
> Priority: Major
> Attachments: ORC-562.01.patch
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> {code:sql}
> create table tbl1 (a int, b string) partitioned by (ds string) stored as orc tblproperties ('transactional'='true');
> insert into tbl1 partition (ds) values (1, 'fred', 'today'), (2, 'wilma', 'yesterday');
> {code}
> As this table is transactional, all the modifications will generate a new delta directory, containing a delta file in orc format. The schema of this file will be
> {code:sql}
> struct<operation:int,originaltransaction:bigint,bucket:int,rowid:bigint,currenttransaction:bigint,row:struct<a:int,b:string>>
> {code}
> If I create a new partitioned table with the very same schema, and change the partition location to one of the delta directories, I would assume that I would be able to run queries against the contents of the delta file.
> Right now this is not possible in orc, because the original readerschema is wrapped in acidschema again, regardless that the readerschema is already acid.
> {code:sql}
> struct<operation:int,originalTransaction:bigint,bucket:int,rowId:bigint,currentTransaction:bigint,row:struct<operation:int,originaltransaction:bigint,bucket:int,rowid:bigint,currenttransaction:bigint,row:struct<a:int,b:string>>>
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)