You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/11/15 08:51:00 UTC

[jira] [Work logged] (HIVE-21146) Enforce TransactionBatch size=1 for blob stores

     [ https://issues.apache.org/jira/browse/HIVE-21146?focusedWorklogId=344061&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-344061 ]

ASF GitHub Bot logged work on HIVE-21146:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 15/Nov/19 08:50
            Start Date: 15/Nov/19 08:50
    Worklog Time Spent: 10m 
      Work Description: dlavati commented on issue #797: HIVE-21146 Enforce TransactionBatch size=1 for blob stores
URL: https://github.com/apache/hive/pull/797#issuecomment-554269771
 
 
   @t3rmin4t0r, @prasanthj is this something you could review please?
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 344061)
    Time Spent: 20m  (was: 10m)

> Enforce TransactionBatch size=1 for blob stores
> -----------------------------------------------
>
>                 Key: HIVE-21146
>                 URL: https://issues.apache.org/jira/browse/HIVE-21146
>             Project: Hive
>          Issue Type: Bug
>          Components: Streaming, Transactions
>    Affects Versions: 3.0.0
>            Reporter: Eugene Koifman
>            Assignee: David Lavati
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: HIVE-21146.2.patch, HIVE-21146.2.patch, HIVE-21146.2.patch, HIVE-21146.patch
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Streaming Ingest API supports a concept of {{TransactionBatch}} where N transactions can be opened at once and the data in all of them will be written to the same delta_x_y directory where each transaction in the batch can be committed/aborted independently.  The implementation relies on {{FSDataOutputStream.hflush()}} (called from OrcRecordUpdater}} which is available on HDFS but is often implemented as no-op in Blob store backed {{FileSystem}} objects.
> Need to add a check to {{HiveStreamingConnection()}} constructor to raise an error if {{builder.transactionBatchSize > 1}} and the target table/partitions are backed by something that doesn't support {{hflush()}}.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)