You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@asterixdb.apache.org by "Taewoo Kim (JIRA)" <ji...@apache.org> on 2017/02/13 18:26:41 UTC

[jira] [Commented] (ASTERIXDB-1791) HyracksDataException: Premature file in Join operator

    [ https://issues.apache.org/jira/browse/ASTERIXDB-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15864152#comment-15864152 ] 

Taewoo Kim commented on ASTERIXDB-1791:
---------------------------------------

The truncated file (partition) is re-read to do an in-memory hash join. What a weird case. I'm checking the code again. 

> HyracksDataException: Premature file in Join operator
> -----------------------------------------------------
>
>                 Key: ASTERIXDB-1791
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1791
>             Project: Apache AsterixDB
>          Issue Type: Bug
>            Reporter: Yingyi Bu
>            Assignee: Taewoo Kim
>
> Configuration:
> 2 NC
> 1 GB RAM each
> TPC-H dataset with SF=2
> Query (TPC-H Q18):
> {noformat}
> USE tpch;
> WITH tmp AS
> (
>     SELECT l_orderkey, SUM(l_quantity) t_sum_quantity
>     FROM  LineItem
>     GROUP BY l_orderkey
> )
> SELECT c.c_name, c.c_custkey, o.o_orderkey, o.o_orderdate, o.o_totalprice,
>        SUM(l.l_quantity) sum_quantity
> FROM  Customer c
> JOIN  Orders o ON c.c_custkey = o.o_custkey
> JOIN  tmp t ON o.o_orderkey = t.l_orderkey
> JOIN  LineItem l ON t.l_orderkey = l.l_orderkey
> WHERE t.t_sum_quantity > 30
> GROUP BY c.c_name, c.c_custkey, o.o_orderkey, o.o_orderdate, o.o_totalprice
> ORDER BY o.o_totalprice DESC, o.o_orderdate
> LIMIT 100
> ;
> {noformat}
> Error:
> {noformat}
> Premature end of file
>         at org.apache.hyracks.control.cc.job.JobRun.waitForCompletion(JobRun.java:223)
>         at org.apache.hyracks.control.cc.work.WaitForJobCompletionWork$1.run(WaitForJobCompletionWork.java:50)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>         at java.lang.Thread.run(Thread.java:745)
> Caused by: org.apache.hyracks.api.exceptions.HyracksDataException: Premature end of file
>         at org.apache.hyracks.api.exceptions.HyracksDataException.create(HyracksDataException.java:41)
>         at org.apache.hyracks.control.common.utils.ExceptionUtils.setNodeIds(ExceptionUtils.java:62)
>         at org.apache.hyracks.control.nc.Task.run(Task.java:330)
>         ... 3 more
> Caused by: org.apache.hyracks.api.exceptions.HyracksDataException: Premature end of file
>         at org.apache.hyracks.dataflow.common.io.RunFileReader.nextFrame(RunFileReader.java:66)
>         at org.apache.hyracks.dataflow.std.join.OptimizedHybridHashJoinOperatorDescriptor$ProbeAndJoinActivityNode$1.applyInMemHashJoin(OptimizedHybridHashJoinOperatorDescriptor.java:672)
>         at org.apache.hyracks.dataflow.std.join.OptimizedHybridHashJoinOperatorDescriptor$ProbeAndJoinActivityNode$1.joinPartitionPair(OptimizedHybridHashJoinOperatorDescriptor.java:503)
>         at org.apache.hyracks.dataflow.std.join.OptimizedHybridHashJoinOperatorDescriptor$ProbeAndJoinActivityNode$1.close(OptimizedHybridHashJoinOperatorDescriptor.java:435)
>         at org.apache.hyracks.dataflow.std.join.OptimizedHybridHashJoinOperatorDescriptor$ProbeAndJoinActivityNode$1.close(OptimizedHybridHashJoinOperatorDescriptor.java:439)
>         at org.apache.hyracks.control.nc.Task.pushFrames(Task.java:367)
>         at org.apache.hyracks.control.nc.Task.run(Task.java:308)
>         ... 3 more
> Feb 13, 2017 4:05:33 PM org.apache.asterix.api.http.server.QueryServiceServlet handleRequest
> SEVERE: Job failed on account of:
> Premature end of file
> org.apache.hyracks.api.exceptions.HyracksException: Job failed on account of:
> Premature end of file
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)