You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Vladislav (Jira)" <ji...@apache.org> on 2022/02/11 13:03:00 UTC

[jira] [Updated] (HIVE-25954) LOAD DATA INPATH can't load data from empty Sequence File (.seq)

     [ https://issues.apache.org/jira/browse/HIVE-25954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vladislav updated HIVE-25954:
-----------------------------
    Description: 
Whether using pyhive or impala, I'v got the same error when loading data to external table via 

LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')

OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask

P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.

impala v0.17.0

pyhive '0.6.4'

Spark  2.3.0.2.6.5.0-292

 

Maybe, there is any setting in pyhive to fix this problem?

More detailed response from beeline 

ERROR : Failed with exception Wrong file format. Please check the file's format.
org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
        at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
        at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
        at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
        at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
        at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
        at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165)        at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
        at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
        at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
        at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)

Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
k (state=08S01,code=1)

  was:
Whether using pyhive or impala, I'v got the same error when loading data to external table via 

LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')

OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask

P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.

impala v0.17.0

pyhive '0.6.4'

Spark  2.3.0.2.6.5.0-292

 

Maybe, there is any setting in pyhive to fix this problem via?

More detailed response from beeline 

ERROR : Failed with exception Wrong file format. Please check the file's format.
org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
        at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
        at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
        at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
        at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
        at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
        at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165)        at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
        at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
        at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
        at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)

Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
k (state=08S01,code=1)


> LOAD DATA INPATH can't load data from empty Sequence File (.seq)
> ----------------------------------------------------------------
>
>                 Key: HIVE-25954
>                 URL: https://issues.apache.org/jira/browse/HIVE-25954
>             Project: Hive
>          Issue Type: Bug
>          Components: API
>            Reporter: Vladislav
>            Priority: Major
>
> Whether using pyhive or impala, I'v got the same error when loading data to external table via 
> LOAD DATA INPATH 'path to empty .seq file' into table external_table_name partition (part='value')
> OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
> P.S. In the case of loading .seq file of the size > 0M or loading that emty file via Spark I haven't any problem.
> impala v0.17.0
> pyhive '0.6.4'
> Spark  2.3.0.2.6.5.0-292
>  
> Maybe, there is any setting in pyhive to fix this problem?
> More detailed response from beeline 
> ERROR : Failed with exception Wrong file format. Please check the file's format.
> org.apache.hadoop.hive.ql.metadata.HiveException: Wrong file format. Please check the file's format.
>         at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:360)
>         at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:162)
>         at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:89)
>         at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1765)
>         at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1506)
>         at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1303)
>         at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1170)
>         at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1165)        at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:197)
>         at org.apache.hive.service.cli.operation.SQLOperation.access$300(SQLOperation.java:76)
>         at org.apache.hive.service.cli.operation.SQLOperation$2$1.run(SQLOperation.java:255)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:422)
>         at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
>         at org.apache.hive.service.cli.operation.SQLOperation$2.run(SQLOperation.java:266)
>         at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>         at java.lang.Thread.run(Thread.java:748)
> Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTas
> k (state=08S01,code=1)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)