You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Hive QA (Jira)" <ji...@apache.org> on 2020/02/21 21:41:00 UTC

[jira] [Commented] (HIVE-22832) Parallelise direct insert directory cleaning process

    [ https://issues.apache.org/jira/browse/HIVE-22832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17042185#comment-17042185 ] 

Hive QA commented on HIVE-22832:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12994113/HIVE-22832.2.patch

{color:red}ERROR:{color} -1 due to build exiting with an error

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/20774/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/20774/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-20774/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Tests exited with: NonZeroExitCodeException
Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N'
2020-02-21 21:39:34.069
+ [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]]
+ export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m '
+ ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m '
+ export 'MAVEN_OPTS=-Xmx1g '
+ MAVEN_OPTS='-Xmx1g '
+ cd /data/hiveptest/working/
+ tee /data/hiveptest/logs/PreCommit-HIVE-Build-20774/source-prep.txt
+ [[ false == \t\r\u\e ]]
+ mkdir -p maven ivy
+ [[ git = \s\v\n ]]
+ [[ git = \g\i\t ]]
+ [[ -z master ]]
+ [[ -d apache-github-source-source ]]
+ [[ ! -d apache-github-source-source/.git ]]
+ [[ ! -d apache-github-source-source ]]
+ date '+%Y-%m-%d %T.%3N'
2020-02-21 21:39:34.072
+ cd apache-github-source-source
+ git fetch origin
From https://github.com/apache/hive
   c3f3523..6c3ee53  master     -> origin/master
+ git reset --hard HEAD
HEAD is now at c3f3523 HIVE-22744 : TezTask for the vertex with more than one outedge should have proportional sort memory (Ramesh Kumar via Ashutosh Chauhan)
+ git clean -f -d
Removing standalone-metastore/metastore-server/src/gen/
+ git checkout master
Already on 'master'
Your branch is behind 'origin/master' by 1 commit, and can be fast-forwarded.
  (use "git pull" to update your local branch)
+ git reset --hard origin/master
HEAD is now at 6c3ee53 HIVE-21216: Write Parquet INT64 timestamp (Karen Coppage via Marta Kuczora)
+ git merge --ff-only origin/master
Already up-to-date.
+ date '+%Y-%m-%d %T.%3N'
2020-02-21 21:39:35.481
+ rm -rf ../yetus_PreCommit-HIVE-Build-20774
+ mkdir ../yetus_PreCommit-HIVE-Build-20774
+ git gc
+ cp -R . ../yetus_PreCommit-HIVE-Build-20774
+ mkdir /data/hiveptest/logs/PreCommit-HIVE-Build-20774/yetus
+ patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh
+ patchFilePath=/data/hiveptest/working/scratch/build.patch
+ [[ -f /data/hiveptest/working/scratch/build.patch ]]
+ chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh
+ /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch
Trying to apply the patch with -p0
error: patch failed: ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java:32
error: repository lacks the necessary blob to fall back on 3-way merge.
error: ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java: patch does not apply
Trying to apply the patch with -p1
error: src/java/org/apache/hadoop/hive/ql/exec/Utilities.java: does not exist in index
Trying to apply the patch with -p2
error: java/org/apache/hadoop/hive/ql/exec/Utilities.java: does not exist in index
The patch does not appear to apply with p0, p1, or p2
+ result=1
+ '[' 1 -ne 0 ']'
+ rm -rf yetus_PreCommit-HIVE-Build-20774
+ exit 1
'
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12994113 - PreCommit-HIVE-Build

> Parallelise direct insert directory cleaning process
> ----------------------------------------------------
>
>                 Key: HIVE-22832
>                 URL: https://issues.apache.org/jira/browse/HIVE-22832
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Marton Bod
>            Assignee: Marton Bod
>            Priority: Major
>         Attachments: HIVE-22832.1.patch, HIVE-22832.2.patch
>
>
> Inside Utilities::handleDirectInsertTableFinalPath, the cleanDirectInsertDirectories method is called sequentially for each element of the directInsertDirectories list, which might have a large number of elements depending on how many partitions were written. This current sequential execution could be improved by parallelising the clean up process. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)