You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "RACHIT CHAUHAN (JIRA)" <ji...@apache.org> on 2018/11/03 21:35:00 UTC

[jira] [Commented] (HIVE-14269) Performance optimizations for data on S3

    [ https://issues.apache.org/jira/browse/HIVE-14269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16674223#comment-16674223 ] 

RACHIT CHAUHAN commented on HIVE-14269:
---------------------------------------

Hi [~spena]: 

Since so many inter-related issues have been created around this thing, it's becoming difficult for new members looking at it.

A summarized version of what this Jira issue has been able to achieve till date will help a lot.

Can you kindly summarize about how we can optimize S3 writes for HIVE tables ?

 

 

 

> Performance optimizations for data on S3
> ----------------------------------------
>
>                 Key: HIVE-14269
>                 URL: https://issues.apache.org/jira/browse/HIVE-14269
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 2.1.0
>            Reporter: Sergio Peña
>            Assignee: Sergio Peña
>            Priority: Major
>
> Working with tables that resides on Amazon S3 (or any other object store) have several performance impact when reading or writing data, and also consistency issues.
> This JIRA is an umbrella task to monitor all the performance improvements that can be done in Hive to work better with S3 data.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)