You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@yunikorn.apache.org by "Keep Cooking Life (Jira)" <ji...@apache.org> on 2023/03/13 08:57:00 UTC

[jira] [Created] (YUNIKORN-1631) UI doesn't show the pytorchjob in the application list

Keep Cooking Life created YUNIKORN-1631:
-------------------------------------------

             Summary: UI doesn't show the pytorchjob in the application list 
                 Key: YUNIKORN-1631
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-1631
             Project: Apache YuniKorn
          Issue Type: Bug
          Components: webapp
         Environment: yunikorn1.2.0 kubenetes 1.23.16 ubuntu 22.04
            Reporter: Keep Cooking Life
             Fix For: 1.2.0
         Attachments: image-2023-03-13-16-34-47-769.png, unnamed (1).png, unnamed (2).png

I use this website to install kubeflow https://yunikorn.apache.org/docs/user_guide/workloads/run_tf/

when I  had set kubeflow application  , I watched the website [http://localhost:9889|http://localhost:9889/] and I notice that the application yunikorn-kubeflow-autogen didn't appear.

the command I applied  below
wget [https://raw.githubusercontent.com/kubeflow/training-operator/master/examples/pytorch/simple.yaml]
kubectl apply -k "[github.com/kubeflow/training-operator/manifests/overlays/standalone?ref=v1.5.0|http://github.com/kubeflow/training-operator/manifests/overlays/standalone?ref=v1.5.0]"
kubectl apply -f simple.yaml
the yaml I used below:

apiVersion: "[kubeflow.org/v1|http://kubeflow.org/v1]"
kind: PyTorchJob
metadata:
  name: pytorch-simple
  namespace: kubeflow
spec:
  pytorchReplicaSpecs:
    Master:
      replicas: 1
      restartPolicy: OnFailure
      template:
        spec:
          schedulerName: yunikorn
          containers:
            - name: pytorch
              image: [docker.io/kubeflowkatib/pytorch-mnist:v1beta1-45c5727|http://docker.io/kubeflowkatib/pytorch-mnist:v1beta1-45c5727]
              imagePullPolicy: Always
              command:
                - "python3"
                - "/opt/pytorch-mnist/mnist.py"
                - "--epochs=1"
    Worker:
      replicas: 1
      restartPolicy: OnFailure
      template:
        spec:
          schedulerName: yunikorn
          containers:
            - name: pytorch
              image: [docker.io/kubeflowkatib/pytorch-mnist:v1beta1-45c5727|http://docker.io/kubeflowkatib/pytorch-mnist:v1beta1-45c5727]
              imagePullPolicy: Always
              command:
                - "python3"
                - "/opt/pytorch-mnist/mnist.py"
                - "--epochs=1"



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org