You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Chun-Hung Hsiao (JIRA)" <ji...@apache.org> on 2018/11/21 19:04:00 UTC

[jira] [Assigned] (MESOS-9109) Windows agent uses reserved character :(colon) for file name and crashes when attempting to remove link

     [ https://issues.apache.org/jira/browse/MESOS-9109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chun-Hung Hsiao reassigned MESOS-9109:
--------------------------------------

    Assignee:     (was: Chun-Hung Hsiao)

> Windows agent uses reserved character :(colon) for file name and crashes when attempting to remove link
> -------------------------------------------------------------------------------------------------------
>
>                 Key: MESOS-9109
>                 URL: https://issues.apache.org/jira/browse/MESOS-9109
>             Project: Mesos
>          Issue Type: Bug
>          Components: agent
>    Affects Versions: 1.7.0
>            Reporter: Constantin Eduard Staniloiu
>            Priority: Blocker
>              Labels: windows
>
> I have a hybrid cluster running Mesos Agents on Windows, and I am using Chronos to launch jobs on Windows Agents.
> Chronos is using the character : (colon) internally when spawning jobs. The Windows Mesos Agent spawns those jobs and creates the paths on disk, but when the job terminates and it attempts to remove the link it crashes with the following error message 
>   
> {code:java}
> I0719 09:20:00.621385 14788 gc.cpp:129] Unscheduling 'D:\ws\mes-wd\meta\slaves\5563b512-518e-44c6-bdc1-3c927d0622da-S1\frameworks\77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000\executors\ct:1532006400000:0
> :sample-child-job-lv2:' from gc
> I0719 09:20:00.622387 24124 slave.cpp:2406] Authorizing task 'ct:1532006400000:0:sample-child-job2:' for framework 77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000
> I0719 09:20:00.630340 24124 slave.cpp:2406] Authorizing task 'ct:1532006400000:0:sample-child-job-lv2:' for framework 77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000
> I0719 09:20:00.644341 24124 slave.cpp:2849] Launching task 'ct:1532006400000:0:sample-child-job2:' for framework 77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000
> I0719 09:20:00.649345 24124 paths.cpp:748] Creating sandbox 'D:\ws\mes-wd\slaves\5563b512-518e-44c6-bdc1-3c927d0622da-S1\frameworks\77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000\executors\ct:1532006400000
> :0:sample-child-job2:\runs\cecbf7ab-ace3-4f45-a208-9c104f69624c'
> F0719 09:20:00.653342 24124 paths.cpp:763] CHECK_SOME(os::rm(latest)): The filename, directory name, or volume label syntax is incorrect.
> Failed to remove latest symlink 'D:\ws\mes-wd\slaves\5563b512-518e-44c6-bdc1-3c927d0622da-S1\frameworks\77a0fb6f-3c43-4d7b-ae16-af2dfd728567-0000\executors\ct:1532006400000:0:sample-child-job2:\runs\
> latest'
> *** Check failure stack trace: ***
> {code}
>  
> The problem seems to be the job name: 
> {code:java}
> 'ct:1532006400000:0:sample-child-job2:'
> {code}
> Chronos is using internally : (colon) which is a reserved character on Windows [https://docs.microsoft.com/en-us/windows/desktop/FileIO/naming-a-file|http://example.com/] 
> I believe it's the responsibility of the agent to check and sanitize the task names against restricted characters.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)