You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Vladislav (Jira)" <ji...@apache.org> on 2022/02/11 13:03:00 UTC
[jira] [Updated] (HIVE-25954) LOAD DATA INPATH can't load data from empty Sequence File (.seq)
[ https://issues.apache.org/jira/browse/HIVE-25954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vladislav updated HIVE-25954:
-----------------------------
Description:
Whether using pyhive or impala, I'v got the same error when loading data to external table via
LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')
OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.
impala v0.17.0
pyhive '0.6.4'
Spark 2.3.0.2.6.5.0-292
Maybe, there is any setting in pyhive to fix this problem?
More detailed response from beeline
ERROR : Failed with exception Wrong file format. Please check the file's format.
org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165) at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
k (state=08S01,code=1)
was:
Whether using pyhive or impala, I'v got the same error when loading data to external table via
LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')
OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.
impala v0.17.0
pyhive '0.6.4'
Spark 2.3.0.2.6.5.0-292
Maybe, there is any setting in pyhive to fix this problem via?
More detailed response from beeline
ERROR : Failed with exception Wrong file format. Please check the file's format.
org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165) at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
k (state=08S01,code=1)
> LOAD DATA INPATH can't load data from empty Sequence File (.seq)
> ----------------------------------------------------------------
>
> Key: HIVE-25954
> URL: https://issues.apache.org/jira/browse/HIVE-25954
> Project: Hive
> Issue Type: Bug
> Components: API
> Reporter: Vladislav
> Priority: Major
>
> Whether using pyhive or impala, I'v got the same error when loading data to external table via
> LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')
> OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
> P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.
> impala v0.17.0
> pyhive '0.6.4'
> Spark 2.3.0.2.6.5.0-292
>
> Maybe, there is any setting in pyhive to fix this problem?
> More detailed response from beeline
> ERROR : Failed with exception Wrong file format. Please check the file's format.
> org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
> at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
> at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
> at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
> at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
> at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
> at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
> at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
> at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165) at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
> at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
> at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:422)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
> at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
> at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
> k (state=08S01,code=1)
--
This message was sent by Atlassian Jira
(v8.20.1#820001)