You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Ashish Kumar (JIRA)" <ji...@apache.org> on 2016/08/07 05:53:20 UTC

[jira] [Created] (TEZ-3403) Empty partition issue with Hive on TEZ

Ashish Kumar created TEZ-3403:
---------------------------------

             Summary: Empty partition issue with Hive on TEZ
                 Key: TEZ-3403
                 URL: https://issues.apache.org/jira/browse/TEZ-3403
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Ashish Kumar


Hi,

I'm experiencing few failures with TEZ regarding Hive partitions. Even though there is no partition column used in the query still it is giving partition file path not found error.

I'm trying to run below query with Hive on TEZ and getting some partition issue. The same query is working fine with MR engine. Used table is external one and having partitions on year and month columns. I've seen few times 

*Query:*

*select count(crn) as bookings, month(to_date(from_utc_timestamp(pickup_date,'IST'))) as month from bookings_table and year=2016 group by month(to_date(from_utc_timestamp(pickup_date,'IST')));*

*Error:*

Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: java.io.IOException: java.io.IOException: While processing file s3n://dataplatform-prod-store/warehouse/ola_bookings/year=2016/month=1. null 
at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:78) 
at org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:292) 
at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:148) 
... 14 more 
Caused by: java.io.IOException: java.io.IOException: While processing file s3n://<bucket>/warehouse/bookings_table/year=2016/month=1. null 
at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderNextException(HiveIOExceptionHandlerChain.java:121) 
at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderNextException(HiveIOExceptionHandlerUtil.java:77) 
at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:372) 
at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:79) 
at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:33) 
at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:118) 
at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.next(TezGroupedSplitsInputFormat.java:137) 
at org.apache.tez.mapreduce.lib.MRReaderMapred.next(MRReaderMapred.java:113) 
at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:68) 
... 16 more 

*Another error for other query:*

DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 killedVertices:4 
FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1, vertexId=vertex_1470240409111_2339_1_06, diagnostics=[Vertex vertex_1470240409111_2339_1_06 [Map 1] killed/failed due to:ROOT_INPUT_INIT_FAILURE, Vertex Input: app_sessions initializer failed, vertex=vertex_1470240409111_2339_1_06 [Map 1], java.io.FileNotFoundException: No such file or directory: s3n://dataplatform-prod-store/warehouse/dp_appSessions/year=2015/month=02/day=14/hour=03 
at org.apache.hadoop.fs.s3a.S3AFileSystem.getFileStatus(S3AFileSystem.java:1078) 
at org.apache.hadoop.fs.s3a.S3AFileSystem.listStatus(S3AFileSystem.java:783) 
at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1500) 
at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1540) 
at org.apache.hadoop.fs.FileSystem$4.(FileSystem.java:1704) 
at org.apache.hadoop.fs.FileSystem.listLocatedStatus(FileSystem.java:1703) 
at org.apache.hadoop.mapred.InputPathProcessor.perPathComputation(InputPathProcessor.java:235) 
at org.apache.hadoop.mapred.InputPathProcessor.access$000(InputPathProcessor.java:28) 
at org.apache.hadoop.mapred.InputPathProcessor$2.run(InputPathProcessor.java:338) 
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
at java.util.concurrent.FutureTask.run(FutureTask.java:266) 
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
at java.lang.Thread.run(Thread.java:745) 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)