You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/08/21 14:57:00 UTC

[jira] [Commented] (IGNITE-9313) ML TF integration: killed user script or chief processes didn't restart workers

    [ https://issues.apache.org/jira/browse/IGNITE-9313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16587546#comment-16587546 ] 

ASF GitHub Bot commented on IGNITE-9313:
----------------------------------------

Github user asfgit closed the pull request at:

    https://github.com/apache/ignite/pull/4581


>  ML TF integration: killed user script or chief processes didn't restart workers
> --------------------------------------------------------------------------------
>
>                 Key: IGNITE-9313
>                 URL: https://issues.apache.org/jira/browse/IGNITE-9313
>             Project: Ignite
>          Issue Type: Bug
>          Components: ml
>    Affects Versions: 2.7
>            Reporter: Stepan Pilschikov
>            Assignee: Anton Dmitriev
>            Priority: Major
>              Labels: tf-integration
>             Fix For: 2.7
>
>
> Case:
>  * Run cluster
>  * Filling caches with data
>  * Running python script
>  * Killing user script or chief
> Expected: 
> - chief and user script processes shutdown and run again on same node (-)
> - rerun user script (-) (+)
> - directory with metadata was deleted and created new one in /tmp (-)
> Actual:
> - chief or user script shutting down and run again
> - all workers still running and didn't restart
> - directory with metadata (/tmp/tf_us_*) not deleted
> - new directory with metadata is not created after restart
> - user script did not rerun after 'chief process' killing ('user_script' process killing restarting script execution)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)