You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2017/10/02 23:17:00 UTC
[jira] [Updated] (HIVE-17658) Bucketed/Sorted tables - SMB join
[ https://issues.apache.org/jira/browse/HIVE-17658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Eugene Koifman updated HIVE-17658:
----------------------------------
Description:
How does this handle tables that are bucketed + sorted?
insert into T values(1,2),(5,6); creates something like delta_2_2/bucket_1
insert into T values(3,4),(7,8) creates delta_3_3/bucket_1
the expectation for any reader would be to see some contiguous subset of (1,2),(3,4),(5,6),(7,8)
but this would require a special reader which I don't see
In particular it's not clear how SMB join can work
was:
How does this handle tables that are bucketed + sorted?
insert into T values(1,2),(5,6); creates something like delta_2_2/bucket_1
insert into T values(3,4),(7,8) creates delta_3_3/bucket_1
the expectation for any reader would be to see some contiguous subset of (1,2),(3,4),(5,6),(7,8)
but this would require a special reader which I don't see
In particular it's not clear how SMB join can work
> Bucketed/Sorted tables - SMB join
> ---------------------------------
>
> Key: HIVE-17658
> URL: https://issues.apache.org/jira/browse/HIVE-17658
> Project: Hive
> Issue Type: Sub-task
> Components: Transactions
> Reporter: Eugene Koifman
>
> How does this handle tables that are bucketed + sorted?
> insert into T values(1,2),(5,6); creates something like delta_2_2/bucket_1
> insert into T values(3,4),(7,8) creates delta_3_3/bucket_1
> the expectation for any reader would be to see some contiguous subset of (1,2),(3,4),(5,6),(7,8)
> but this would require a special reader which I don't see
> In particular it's not clear how SMB join can work
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)