You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "weibin0516 (JIRA)" <ji...@apache.org> on 2019/07/25 03:04:00 UTC
[jira] [Assigned] (KYLIN-4107) StorageCleanupJob fails to delete
Hive tables with "Argument list too long" error
[ https://issues.apache.org/jira/browse/KYLIN-4107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
weibin0516 reassigned KYLIN-4107:
---------------------------------
Assignee: weibin0516
> StorageCleanupJob fails to delete Hive tables with "Argument list too long" error
> ---------------------------------------------------------------------------------
>
> Key: KYLIN-4107
> URL: https://issues.apache.org/jira/browse/KYLIN-4107
> Project: Kylin
> Issue Type: Bug
> Components: Storage - HBase
> Affects Versions: v2.6.2
> Environment: CentOS 7.6, HDP 2.6.5, Kylin 2.6.3
> Reporter: Vsevolod Ostapenko
> Assignee: weibin0516
> Priority: Major
> Fix For: v3.0.0-beta
>
>
> On a system with multiple Kylin developers that experiment with cube design and (re)build/drop cube segments often intermediate Hive tables and HBase left over tables accumulate very quickly.
> After a certain point storage cleanup cannot be executed using suggested method:
> {{${KYLIN_HOME}/bin/kylin.sh org.apache.kylin.tool.StorageCleanupJob --delete true}}
> Apparently, storage cleanup job creates a single shell command to drop all Hive tables, which fails to execute because command line is just too long. For example:
> {quote}
> 2019-07-23 17:47:31,611 ERROR [main] job.StorageCleanupJob:377 : Error during deleting Hive tables
> java.io.IOException: Cannot run program "/bin/bash": error=7, Argument list too long
> at java.lang.ProcessBuilder.start(ProcessBuilder.java:1048)
> at org.apache.kylin.common.util.CliCommandExecutor.runNativeCommand(CliCommandExecutor.java:133)
> at org.apache.kylin.common.util.CliCommandExecutor.execute(CliCommandExecutor.java:89)
> at org.apache.kylin.common.util.CliCommandExecutor.execute(CliCommandExecutor.java:83)
> at org.apache.kylin.rest.job.StorageCleanupJob.deleteHiveTables(StorageCleanupJob.java:409)
> at org.apache.kylin.rest.job.StorageCleanupJob.cleanUnusedIntermediateHiveTableInternal(StorageCleanupJob.java:375)
> at org.apache.kylin.rest.job.StorageCleanupJob.cleanUnusedIntermediateHiveTable(StorageCleanupJob.java:278)
> at org.apache.kylin.rest.job.StorageCleanupJob.cleanup(StorageCleanupJob.java:151)
> at org.apache.kylin.rest.job.StorageCleanupJob.execute(StorageCleanupJob.java:145)
> at org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
> at org.apache.kylin.tool.StorageCleanupJob.main(StorageCleanupJob.java:27)
> Caused by: java.io.IOException: error=7, Argument list too long
> at java.lang.UNIXProcess.forkAndExec(Native Method)
> at java.lang.UNIXProcess.<init>(UNIXProcess.java:247)
> at java.lang.ProcessImpl.start(ProcessImpl.java:134)
> at java.lang.ProcessBuilder.start(ProcessBuilder.java:1029)
> ... 10 moreĀ
> {quote}
> Instead of composing one long command, storage cleanup need to generate a script and feed that into beeline or hive CLI.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)