You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2017/07/31 17:48:00 UTC
[jira] [Created] (HIVE-17215) Streaming Ingest API writing
unbucketed tables
Eugene Koifman created HIVE-17215:
-------------------------------------
Summary: Streaming Ingest API writing unbucketed tables
Key: HIVE-17215
URL: https://issues.apache.org/jira/browse/HIVE-17215
Project: Hive
Issue Type: Sub-task
Components: Transactions
Reporter: Eugene Koifman
Assignee: Eugene Koifman
Currently the API expects the target table to be bucketed.
It creates 1 writer per bucket per connection/partition.
The simplest is to allow the API to create a single writer for unbucketed tables.
If this doesn't provide enough write throughput, the client can create another connection.
Could add a parameter to the API to specify writer parallelism for unbucketed tables. If it's set to 2 for example, the writer will write delta_x_y_0000 and delta_x_y_00001 using statementId. Maybe as a followup.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)