You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@slider.apache.org by "Steve Loughran (JIRA)" <ji...@apache.org> on 2014/11/01 16:03:33 UTC
[jira] [Commented] (SLIDER-599) When application is created as user
hdfs need to call destroy twice to delete the hdfs folder for the app
[ https://issues.apache.org/jira/browse/SLIDER-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14193212#comment-14193212 ]
Steve Loughran commented on SLIDER-599:
---------------------------------------
There shouldn't be anything special about the user "hdfs" that is causing this...delete should be the same for all. And if the directory delete fails, there should be a a warning printed.
It looks like the subdir {{hdfs://c6403.ambari.apache.org:8020/user/hdfs/.slider/cluster/cl1/database}} is present at create time
If this happens repeatedly, can the dfs -ls command be issued *after* the first destroy, and before the attempt to recreate the directory is made. then do a full ls/ of the contents of the {{database}} subdir, if present.
There's a possibility that this is some race condition after the teardown of the previous instance...something was running which added something under the database/ path, including creating all the parents. If this is the case, then rejecting the create is possibly the correct action: there's data there that needs to be looked at, or an explicit decision to delete it should be made.
> When application is created as user hdfs need to call destroy twice to delete the hdfs folder for the app
> ---------------------------------------------------------------------------------------------------------
>
> Key: SLIDER-599
> URL: https://issues.apache.org/jira/browse/SLIDER-599
> Project: Slider
> Issue Type: Bug
> Components: client
> Affects Versions: Slider 0.50
> Reporter: Sumit Mohanty
> Assignee: Steve Loughran
> Fix For: Slider 0.60
>
>
> It was also reported by another user. This is not a critical issue as it is not expected that application be created as user "hdfs".
> Assigning to check if there is any other issue hiding behind this symptom.
> {noformat}
> [hdfs@c6403 bin]$ ./slider destroy cl1
> 2014-11-01 04:34:06,112 [main] INFO impl.TimelineClientImpl - Timeline service address: http://c6403.ambari.apache.org:8188/ws/v1/timeline/
> 2014-11-01 04:34:07,161 [main] WARN shortcircuit.DomainSocketFactory - The short-circuit local reads feature cannot be used because libhadoop cannot be loaded.
> 2014-11-01 04:34:07,172 [main] INFO client.RMProxy - Connecting to ResourceManager at c6403.ambari.apache.org/192.168.64.103:8050
> 2014-11-01 04:34:07,516 [main] INFO zk.BlockingZKWatcher - waiting for ZK event
> 2014-11-01 04:34:07,568 [main-EventThread] INFO zk.BlockingZKWatcher - ZK binding callback received
> 2014-11-01 04:34:07,572 [main] INFO client.SliderClient - Deleting zookeeper path /services/slider/users/hdfs/cl1
> 2014-11-01 04:34:07,852 [main] INFO imps.CuratorFrameworkImpl - Starting
> 2014-11-01 04:34:07,942 [main-EventThread] INFO state.ConnectionStateManager - State change: CONNECTED
> 2014-11-01 04:34:07,943 [ConnectionStateManager-0] WARN state.ConnectionStateManager - There are no ConnectionStateListeners registered.
> 2014-11-01 04:34:08,969 [main] INFO client.SliderClient - Destroyed cluster cl1
> 2014-11-01 04:34:08,977 [main] INFO util.ExitUtil - Exiting with status 0
> {noformat}
> {noformat}
> [hdfs@c6403 bin]$ ./slider create cl1 --template /usr/work/hbase/appConfig.json --resources /usr/work/hbase/resources.json
> 2014-11-01 04:35:12,816 [main] INFO impl.TimelineClientImpl - Timeline service address: http://c6403.ambari.apache.org:8188/ws/v1/timeline/
> 2014-11-01 04:35:13,561 [main] WARN shortcircuit.DomainSocketFactory - The short-circuit local reads feature cannot be used because libhadoop cannot be loaded.
> 2014-11-01 04:35:13,568 [main] INFO client.RMProxy - Connecting to ResourceManager at c6403.ambari.apache.org/192.168.64.103:8050
> 2014-11-01 04:35:14,028 [main] INFO zk.BlockingZKWatcher - waiting for ZK event
> 2014-11-01 04:35:14,052 [main-EventThread] INFO zk.BlockingZKWatcher - ZK binding callback received
> 2014-11-01 04:35:14,063 [main] INFO agent.AgentClientProvider - Validating app definition .slider/package/HBASE/slider-hbase-app-package-0.98.4.2.2.0.0-1623-hadoop2.zip
> 2014-11-01 04:35:14,064 [main] INFO agent.AgentUtils - Reading metainfo at .slider/package/HBASE/slider-hbase-app-package-0.98.4.2.2.0.0-1623-hadoop2.zip
> 2014-11-01 04:35:14,299 [main] INFO tools.SliderUtils - Reading metainfo.xml of size 6909
> 2014-11-01 04:35:14,447 [main] ERROR tools.CoreFileSystem - Dir hdfs://c6403.ambari.apache.org:8020/user/hdfs/.slider/cluster/cl1 exists: hdfs://c6403.ambari.apache.org:8020/user/hdfs/.slider/cluster/cl1/database 0
> 2014-11-01 04:35:14,448 [main] ERROR main.ServiceLauncher - Application Instance dir already exists: hdfs://c6403.ambari.apache.org:8020/user/hdfs/.slider/cluster/cl1
> 2014-11-01 04:35:14,450 [main] INFO util.ExitUtil - Exiting with status 75
> {noformat}
> {noformat}
> [hdfs@c6403 bin]$ hdfs dfs -ls /user/hdfs/.slider/cluster
> Found 1 items
> drwxr-xr-x - hdfs hdfs 0 2014-11-01 04:34 /user/hdfs/.slider/cluster/cl1
> {noformat}
> {noformat}
> [hdfs@c6403 bin]$ ./slider destroy cl1
> 2014-11-01 04:37:25,003 [main] INFO impl.TimelineClientImpl - Timeline service address: http://c6403.ambari.apache.org:8188/ws/v1/timeline/
> 2014-11-01 04:37:25,682 [main] WARN shortcircuit.DomainSocketFactory - The short-circuit local reads feature cannot be used because libhadoop cannot be loaded.
> 2014-11-01 04:37:25,692 [main] INFO client.RMProxy - Connecting to ResourceManager at c6403.ambari.apache.org/192.168.64.103:8050
> 2014-11-01 04:37:25,965 [main] INFO zk.BlockingZKWatcher - waiting for ZK event
> 2014-11-01 04:37:25,989 [main-EventThread] INFO zk.BlockingZKWatcher - ZK binding callback received
> 2014-11-01 04:37:25,993 [main] INFO client.SliderClient - Deleting zookeeper path /services/slider/users/hdfs/cl1
> 2014-11-01 04:37:26,037 [main] INFO imps.CuratorFrameworkImpl - Starting
> 2014-11-01 04:37:26,099 [main-EventThread] INFO state.ConnectionStateManager - State change: CONNECTED
> 2014-11-01 04:37:26,100 [ConnectionStateManager-0] WARN state.ConnectionStateManager - There are no ConnectionStateListeners registered.
> 2014-11-01 04:37:27,107 [main] INFO client.SliderClient - Destroyed cluster cl1
> 2014-11-01 04:37:27,109 [main] INFO util.ExitUtil - Exiting with status 0
> {noformat}
> {noformat}
> [hdfs@c6403 bin]$ hdfs dfs -ls /user/hdfs/.slider/cluster
> [hdfs@c6403 bin]$
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)