You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Hive QA (JIRA)" <ji...@apache.org> on 2018/06/14 15:27:00 UTC

[jira] [Commented] (HIVE-19103) Nested structure Projection Push Down in Hive with ORC

    [ https://issues.apache.org/jira/browse/HIVE-19103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16512611#comment-16512611 ] 

Hive QA commented on HIVE-19103:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12927617/HIVE-19103.1.patch

{color:red}ERROR:{color} -1 due to build exiting with an error

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/11782/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/11782/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-11782/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Tests exited with: NonZeroExitCodeException
Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N'
2018-06-14 15:26:24.167
+ [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]]
+ export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m '
+ ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m '
+ export 'MAVEN_OPTS=-Xmx1g '
+ MAVEN_OPTS='-Xmx1g '
+ cd /data/hiveptest/working/
+ tee /data/hiveptest/logs/PreCommit-HIVE-Build-11782/source-prep.txt
+ [[ false == \t\r\u\e ]]
+ mkdir -p maven ivy
+ [[ git = \s\v\n ]]
+ [[ git = \g\i\t ]]
+ [[ -z master ]]
+ [[ -d apache-github-source-source ]]
+ [[ ! -d apache-github-source-source/.git ]]
+ [[ ! -d apache-github-source-source ]]
+ date '+%Y-%m-%d %T.%3N'
2018-06-14 15:26:24.171
+ cd apache-github-source-source
+ git fetch origin
+ git reset --hard HEAD
HEAD is now at f20311b HIVE-19739: Bootstrap REPL LOAD to use checkpoints to validate and skip the loaded data/metadata (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Anishek Agarwal)
+ git clean -f -d
+ git checkout master
Already on 'master'
Your branch is up-to-date with 'origin/master'.
+ git reset --hard origin/master
HEAD is now at f20311b HIVE-19739: Bootstrap REPL LOAD to use checkpoints to validate and skip the loaded data/metadata (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Anishek Agarwal)
+ git merge --ff-only origin/master
Already up-to-date.
+ date '+%Y-%m-%d %T.%3N'
2018-06-14 15:26:25.179
+ rm -rf ../yetus_PreCommit-HIVE-Build-11782
+ mkdir ../yetus_PreCommit-HIVE-Build-11782
+ git gc
+ cp -R . ../yetus_PreCommit-HIVE-Build-11782
+ mkdir /data/hiveptest/logs/PreCommit-HIVE-Build-11782/yetus
+ patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh
+ patchFilePath=/data/hiveptest/working/scratch/build.patch
+ [[ -f /data/hiveptest/working/scratch/build.patch ]]
+ chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh
+ /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch
fatal: corrupt patch at line 795
fatal: corrupt patch at line 795
fatal: corrupt patch at line 795
The patch does not appear to apply with p0, p1, or p2
+ result=1
+ '[' 1 -ne 0 ']'
+ rm -rf yetus_PreCommit-HIVE-Build-11782
+ exit 1
'
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12927617 - PreCommit-HIVE-Build

> Nested structure Projection Push Down in Hive with ORC
> ------------------------------------------------------
>
>                 Key: HIVE-19103
>                 URL: https://issues.apache.org/jira/browse/HIVE-19103
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive, ORC
>            Reporter: Ashish Sharma
>            Assignee: Ashish Sharma
>            Priority: Critical
>              Labels: pull-request-available
>         Attachments: HIVE-19103.1.patch
>
>
> Reading required columns only in nested structure schema
> Example - 
> *Current state* - 
> Schema  -  struct<a:int, b:bigint,c:struct<d:int,e:struct<f:int>,g:string>>
> Query - select c.e.f from t where c.e.f > 10;
> Current state - read entire c struct from the file and then filter because "hive.io.file.readcolumn.ids" is referred due to which all the children column are select to read from the file.
> Conf -
>      _hive.io.file.readcolumn.ids  = "2"
>      hive.io.file.readNestedColumn.paths = "c.e.f"_
> Result -       
> boolean[ ] include  = [true,false,false,true,true,true,true,true]
> *Expected state* -
> Schema  -  struct<a:int, b:bigint,c:struct<d:int,e:struct<f:int>,g:string>>
> Query - select c.e.f from t where c.e.f > 10;
> Expected state - instead of reading entire c struct from the file just read only the f column by referring the  " hive.io.file.readNestedColumn.paths".
> Conf -
>      _hive.io.file.readcolumn.ids  = "2"
>      hive.io.file.readNestedColumn.paths = "c.e.f"_
> Result -       
> boolean[ ] include  = [true,false,false,true,false,true,true,false]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)