You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/01/24 10:33:00 UTC

[jira] [Work logged] (HIVE-22771) Partition location incorrectly formed in FileOutputCommitterContainer

     [ https://issues.apache.org/jira/browse/HIVE-22771?focusedWorklogId=376793&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-376793 ]

ASF GitHub Bot logged work on HIVE-22771:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 24/Jan/20 10:32
            Start Date: 24/Jan/20 10:32
    Worklog Time Spent: 10m 
      Work Description: Shivamohan07 commented on pull request #889:  Correcting regex to de-scratchify final partition location HIVE-22771 
URL: https://github.com/apache/hive/pull/889
 
 
   Correcting regex to de-scratchify final partition location HIVE-22771
   Update FileOutputCommitterContainer.java
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 376793)
    Remaining Estimate: 0h
            Time Spent: 10m

> Partition location incorrectly formed in FileOutputCommitterContainer
> ---------------------------------------------------------------------
>
>                 Key: HIVE-22771
>                 URL: https://issues.apache.org/jira/browse/HIVE-22771
>             Project: Hive
>          Issue Type: Bug
>          Components: HCatalog
>    Affects Versions: 1.2.1
>            Reporter: Shivam
>            Priority: Critical
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Class _HCatOutputFormat_ in package _org.apache.hive.hcatalog.mapreduce_ uses function _setOutput_ to generate _idHash_ using below statement:
> +In file org/apache/hive/hcatalog/mapreduce/HCatOutputFormat.java+
>  *line 116: idHash = String.valueOf(Math.random());*
> The output of idHash can be similar to values like this : 7.145347157239135E-4
>  
> And,
> in class _FileOutputCommitterContainer_ in package _org.apache.hive.hcatalog.mapreduce;_
> Uses below statement to compute final partition path:
> +*In org/apache/hive/hcatalog/mapreduce/FileOutputCommitterContainer.java*+
> *line 366: String finalLocn = jobLocation.replaceAll(Path.SEPARATOR + SCRATCH_DIR_NAME + "{color:#FF0000}\\d\\.?\\d+"{color},"");*
> *line 367: partPath = new Path(finalLocn);*
>  
> Regex used here is incorrect, since it will only remove integers after the *SCRATCH_DIR_NAME,* and hence will append  'E-4' (for the above example) in the final partition location. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)