You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@submarine.apache.org by GitBox <gi...@apache.org> on 2021/02/11 07:01:58 UTC

[GitHub] [submarine] kevin85421 opened a new pull request #510: SUBMARINE-728. The experiment example will cause OOM error.

kevin85421 opened a new pull request #510:
URL: https://github.com/apache/submarine/pull/510


   ### What is this PR for?
   The experiment example will cause OOM error.
   
   
   ### What type of PR is it?
   [Improvement]
   
   ### Todos
   
   ### What is the Jira issue?
   https://issues.apache.org/jira/projects/SUBMARINE/issues/SUBMARINE-728?filter=myopenissues
   
   ### How should this be tested?
   * Step1: Create an experiment
   ```
   curl -X POST -H "Content-Type: application/json" -d '
   {
     "meta": {
       "name": "tf-mnist-json",
       "namespace": "default",
       "framework": "TensorFlow",
       "cmd": "python /var/tf_mnist/mnist_with_summaries.py --log_dir=/train/log --learning_rate=0.01 --batch_size=150",
       "envVars": {
         "ENV_1": "ENV1"
       }
     },
     "environment": {
       "image": "apache/submarine:tf-mnist-with-summaries-1.0"
     },
     "spec": {
       "Ps": {
         "replicas": 1,
         "resources": "cpu=1,memory=1024M"
       },
       "Worker": {
         "replicas": 1,
         "resources": "cpu=1,memory=2048M"
       }
     }
   }
   ' http://127.0.0.1:8080/api/v1/experiment
   ```
   * Step2: Check the status of the Pods
   ```
   kubectl get pods
   ```
   ### Screenshots (if appropriate)
   
   ### Questions:
   * Does the licenses files need update? No
   * Is there breaking changes for older versions? No
   * Does this needs documentation? No
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [submarine] asfgit closed pull request #510: SUBMARINE-728. The experiment example will cause OOM error.

Posted by GitBox <gi...@apache.org>.
asfgit closed pull request #510:
URL: https://github.com/apache/submarine/pull/510


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org