Slackbot
10/13/2022, 2:28 AMXipeng Guan
10/13/2022, 2:40 AMBenjamin Tan
10/13/2022, 2:43 AMapiVersion: <http://rbac.authorization.k8s.io/v1|rbac.authorization.k8s.io/v1>
kind: Role
metadata:
name: yatai-role
namespace: ds-models
rules:
- apiGroups:
- ""
- <http://serving.yatai.ai|serving.yatai.ai>
- <http://networking.k8s.io|networking.k8s.io>
resources:
- pods
- namespaces
- bentodeployments
- ingresses
verbs:
- get
- watch
- list
- create
- update
kubectl create rolebinding yatai-role-binding --user=system:serviceaccount:yatai-system:yatai --role=yatai-role -n ds-models
Benjamin Tan
10/13/2022, 2:45 AM2022-10-13T01:59:08.628374348Z Downloading bento ktp_ocr:uipktisj5sjyzz3e tar file from <http://yatai.yatai-system.svc.cluster.local/api/v1/bento_repositories/ktp_ocr/bentos/uipktisj5sjyzz3e/download> to /tmp/downloaded.tar...
2022-10-13T01:59:08.781018745Z curl: (22) The requested URL returned error: 500
Benjamin Tan
10/13/2022, 2:48 AMyatai.yatai-system.svc.cluster.local
to my host then I can hit the download the bentoBenjamin Tan
10/13/2022, 3:00 AMXipeng Guan
10/13/2022, 3:01 AMhelm upgrade
Benjamin Tan
10/13/2022, 3:01 AMBenjamin Tan
10/13/2022, 3:01 AMBenjamin Tan
10/13/2022, 3:02 AMXipeng Guan
10/13/2022, 3:04 AMBenjamin Tan
10/13/2022, 3:07 AMBenjamin Tan
10/13/2022, 3:08 AMbentoDeploymentNamespaces: ['yatai']
?Xipeng Guan
10/13/2022, 3:08 AMBenjamin Tan
10/13/2022, 3:14 AMhelm upgrade yatai-deployment bentoml/yatai-deployment -n yatai-deployment --set bentoDeploymentNamespaces=<NEW NAMESPACE> --devel
Benjamin Tan
10/13/2022, 3:16 AMdeploy deployment revision: failed to deploy kube bento deployment: failed to get kube bento deployment: conversion webhook for <http://serving.yatai.ai/v1alpha3|serving.yatai.ai/v1alpha3>, Kind=BentoDeployment failed: no kind "BentoDeployment" is registered for version "<http://serving.yatai.ai/v1alpha3|serving.yatai.ai/v1alpha3>" in scheme "pkg/runtime/scheme.go:100"
Xipeng Guan
10/13/2022, 3:23 AMBenjamin Tan
10/13/2022, 3:27 AMBenjamin Tan
10/13/2022, 3:27 AMXipeng Guan
10/13/2022, 3:37 AMkubectl api-resources | grep bento
Benjamin Tan
10/13/2022, 3:37 AMbentodeployments <http://serving.yatai.ai/v1alpha3|serving.yatai.ai/v1alpha3> true BentoDeployment
Xipeng Guan
10/13/2022, 3:38 AMBenjamin Tan
10/13/2022, 3:38 AMk rollout restart deploy yatai-deployment -n yatai-deployment
Benjamin Tan
10/13/2022, 3:38 AMXipeng Guan
10/13/2022, 3:38 AMBenjamin Tan
10/13/2022, 3:39 AMXipeng Guan
10/13/2022, 3:40 AMk -n yatai-deployment get pod
Benjamin Tan
10/13/2022, 3:40 AMBenjamin Tan
10/13/2022, 3:40 AMBenjamin Tan
10/13/2022, 3:40 AMyatai-deployment-7fc55f647b-zmv4m 1/1 Running 0 14m
yatai-deployment-default-domain-wmfbp 0/1 Completed 0 11h
Benjamin Tan
10/13/2022, 3:42 AMXipeng Guan
10/13/2022, 3:43 AMk delete pod ... --force --grace-period=0
Xipeng Guan
10/13/2022, 3:43 AMBenjamin Tan
10/13/2022, 3:44 AMBenjamin Tan
10/13/2022, 3:45 AMXipeng Guan
10/13/2022, 3:45 AMkubectl -n yatai-deployment describe deploy yatai-deployment
Benjamin Tan
10/13/2022, 3:46 AMBenjamin Tan
10/13/2022, 3:47 AMNormal ScalingReplicaSet 39m deployment-controller Scaled up replica set yatai-deployment-5c79585dff to 1
Normal ScalingReplicaSet 38m deployment-controller Scaled down replica set yatai-deployment-9d7f5974f to 0
Normal ScalingReplicaSet 32m deployment-controller Scaled up replica set yatai-deployment-7c84c88f to 1
Normal ScalingReplicaSet 31m deployment-controller Scaled down replica set yatai-deployment-5c79585dff to 0
Normal ScalingReplicaSet 29m deployment-controller Scaled up replica set yatai-deployment-fbf9f44f7 to 1
Normal ScalingReplicaSet 28m deployment-controller Scaled down replica set yatai-deployment-7c84c88f to 0
Normal ScalingReplicaSet 25m deployment-controller Scaled up replica set yatai-deployment-6f688cc494 to 1
Normal ScalingReplicaSet 25m deployment-controller Scaled down replica set yatai-deployment-fbf9f44f7 to 0
Normal ScalingReplicaSet 20m deployment-controller Scaled up replica set yatai-deployment-7fc55f647b to 1
Normal ScalingReplicaSet 20m deployment-controller (combined from similar events): Scaled down replica set yatai-depl
Benjamin Tan
10/13/2022, 3:48 AMdeploy deployment revision: failed to deploy kube bento deployment: failed to get kube bento deployment: conversion webhook for <http://serving.yatai.ai/v1alpha3|serving.yatai.ai/v1alpha3>, Kind=BentoDeployment failed: Post "<https://yatai-deployment-webhook-service.yatai-deployment.svc:443/convert?timeout=30s>": no endpoints available for service "yatai-deployment-webhook-service"
Benjamin Tan
10/13/2022, 3:48 AMBenjamin Tan
10/13/2022, 3:53 AMBenjamin Tan
10/13/2022, 3:53 AMBenjamin Tan
10/13/2022, 3:53 AMNormal ScalingReplicaSet 46m deployment-controller Scaled up replica set yatai-deployment-5c79585dff to 1
Normal ScalingReplicaSet 46m deployment-controller Scaled down replica set yatai-deployment-9d7f5974f to 0
Normal ScalingReplicaSet 39m deployment-controller Scaled up replica set yatai-deployment-7c84c88f to 1
Normal ScalingReplicaSet 39m deployment-controller Scaled down replica set yatai-deployment-5c79585dff to 0
Normal ScalingReplicaSet 36m deployment-controller Scaled up replica set yatai-deployment-fbf9f44f7 to 1
Normal ScalingReplicaSet 36m deployment-controller Scaled down replica set yatai-deployment-7c84c88f to 0
Normal ScalingReplicaSet 32m deployment-controller Scaled up replica set yatai-deployment-6f688cc494 to 1
Normal ScalingReplicaSet 32m deployment-controller Scaled down replica set yatai-deployment-fbf9f44f7 to 0
Normal ScalingReplicaSet 27m deployment-controller Scaled up replica set yatai-deployment-7fc55f647b to 1
Normal ScalingReplicaSet 27m deployment-controller (combined from similar events): Scaled down replica set yatai-deployment-6f688cc494 to 0
Xipeng Guan
10/13/2022, 3:54 AMk -n yatai-deployment get rs -l <http://app.kubernetes.io/name=yatai-deployment|app.kubernetes.io/name=yatai-deployment>
Benjamin Tan
10/13/2022, 4:00 AMyatai-deployment-5b77945db6 0 0 0 11h
yatai-deployment-5c79585dff 0 0 0 53m
yatai-deployment-5ff7446458 0 0 0 146m
yatai-deployment-65f898cc68 0 0 0 11h
yatai-deployment-6f688cc494 0 0 0 39m
yatai-deployment-7c84c88f 0 0 0 46m
yatai-deployment-7fc55f647b 1 1 1 34m
yatai-deployment-9d7f5974f 0 0 0 126m
yatai-deployment-d474f7bb8 0 0 0 11h
yatai-deployment-fbf9f44f7 0 0 0 43m
Xipeng Guan
10/13/2022, 4:02 AMXipeng Guan
10/13/2022, 4:03 AMk -n yatai-deployment delete rs -l <http://app.kubernetes.io/name=yatai-deployment|app.kubernetes.io/name=yatai-deployment>
Xipeng Guan
10/13/2022, 4:03 AMk -n yatai-deployment get rs -l <http://app.kubernetes.io/name=yatai-deployment|app.kubernetes.io/name=yatai-deployment>
Benjamin Tan
10/13/2022, 4:12 AMk -n yatai-deployment get rs -l <http://app.kubernetes.io/name=yatai-deployment|app.kubernetes.io/name=yatai-deployment>
Benjamin Tan
10/13/2022, 4:12 AMXipeng Guan
10/13/2022, 4:12 AMk -n yatai-deployment describe deploy yatai-deployment
Benjamin Tan
10/13/2022, 4:14 AMNormal ScalingReplicaSet 60m deployment-controller Scaled up replica set yatai-deployment-7c84c88f to 1
Normal ScalingReplicaSet 60m deployment-controller Scaled down replica set yatai-deployment-5c79585dff to 0
Normal ScalingReplicaSet 57m deployment-controller Scaled up replica set yatai-deployment-fbf9f44f7 to 1
Normal ScalingReplicaSet 57m deployment-controller Scaled down replica set yatai-deployment-7c84c88f to 0
Normal ScalingReplicaSet 53m deployment-controller Scaled up replica set yatai-deployment-6f688cc494 to 1
Normal ScalingReplicaSet 53m deployment-controller Scaled down replica set yatai-deployment-fbf9f44f7 to 0
Normal ScalingReplicaSet 48m deployment-controller Scaled up replica set yatai-deployment-7fc55f647b to 1
Normal ScalingReplicaSet 48m deployment-controller (combined from similar events): Scaled down replica set yatai-deployment-6f688cc494 to 0
Xipeng Guan
10/13/2022, 4:16 AMXipeng Guan
10/13/2022, 4:19 AMk -n yatai-deployment rollout status deploy yatai-deployment
Xipeng Guan
10/13/2022, 4:23 AMk get events --sort-by=.metadata.creationTimestamp
Benjamin Tan
10/13/2022, 4:34 AMBenjamin Tan
10/13/2022, 4:34 AMBenjamin Tan
10/13/2022, 4:35 AMyatai-builders
namespaceBenjamin Tan
10/13/2022, 4:36 AMXipeng Guan
10/13/2022, 4:37 AMyatai-builders
Benjamin Tan
10/13/2022, 4:38 AMyatai-builders
namespaceBenjamin Tan
10/13/2022, 4:39 AMk -n yatai-deployment rollout status deploy yatai-deployment
W1013 12:38:59.380507 65788 gcp.go:119] WARNING: the gcp auth plugin is deprecated in v1.22+, unavailable in v1.26+; use gcloud instead.
To learn more, consult <https://cloud.google.com/blog/products/containers-kubernetes/kubectl-auth-changes-in-gke>
Waiting for deployment spec update to be observed...
Benjamin Tan
10/13/2022, 4:42 AMk get events --sort-by=.metadata.creationTimestamp -n yatai-deployment
W1013 12:41:59.391593 65954 gcp.go:119] WARNING: the gcp auth plugin is deprecated in v1.22+, unavailable in v1.26+; use gcloud instead.
To learn more, consult <https://cloud.google.com/blog/products/containers-kubernetes/kubectl-auth-changes-in-gke>
LAST SEEN TYPE REASON OBJECT MESSAGE
58m Warning Unhealthy pod/yatai-deployment-7fc55f647b-zmv4m Liveness probe failed: Get "<http://10.65.128.154:8081/healthz>": dial tcp 10.65.128.154:8081: connect: connection refused
58m Warning Unhealthy pod/yatai-deployment-7fc55f647b-zmv4m Readiness probe failed: Get "<http://10.65.128.154:8081/readyz>": dial tcp 10.65.128.154:8081: connect: connection refused
56m Normal Killing pod/yatai-deployment-7fc55f647b-zmv4m Stopping container manager
54m Warning FailedMount pod/yatai-deployment-7fc55f647b-zmv4m Unable to attach or mount volumes: unmounted volumes=[cert kube-api-access-gtg56], unattached volumes=[cert kube-api-access-gtg56]: timed out waiting for the condition
54m Warning FailedMount pod/docker-private-registry-proxy-dg9fd MountVolume.SetUp failed for volume "kube-api-access-92k5b" : failed to sync configmap cache: timed out waiting for the condition
52m Normal Killing pod/docker-private-registry-proxy-7pg8n Stopping container tcp-proxy
22m Warning FailedMount pod/docker-private-registry-proxy-9782x MountVolume.SetUp failed for volume "kube-api-access-45xn6" : failed to sync configmap cache: timed out waiting for the condition
15m Normal Killing pod/docker-private-registry-proxy-t6kk7 Stopping container tcp-proxy
14m Warning NetworkNotReady pod/docker-private-registry-proxy-t6kk7 network is not ready: container runtime network not ready: NetworkReady=false reason:NetworkPluginNotReady message:docker: network plugin is not ready: cni config uninitialized
13m Normal SandboxChanged pod/docker-private-registry-proxy-t6kk7 Pod sandbox changed, it will be killed and re-created.
14m Warning FailedCreatePodSandBox pod/docker-private-registry-proxy-t6kk7 Failed to create pod sandbox: rpc error: code = Unknown desc = failed to set up sandbox container "89b5f7db5744a81af886b37b60c88b9aa8bc9bde0fab09b5a15d2858c6774697" network for pod "docker-private-registry-proxy-t6kk7": networkPlugin cni failed to set up pod "docker-private-registry-proxy-t6kk7_yatai-deployment" network: stat /var/lib/calico/nodename: no such file or directory: check that the calico/node container is running and has mounted /var/lib/calico/
13m Warning FailedCreatePodSandBox pod/docker-private-registry-proxy-t6kk7 Failed to create pod sandbox: rpc error: code = Unknown desc = failed to set up sandbox container "778bdc2d6b584580a3eac1bcef1d62fb15e20aa69c80084fafce471a37545a36" network for pod "docker-private-registry-proxy-t6kk7": networkPlugin cni failed to set up pod "docker-private-registry-proxy-t6kk7_yatai-deployment" network: stat /var/lib/calico/nodename: no such file or directory: check that the calico/node container is running and has mounted /var/lib/calico/
13m Normal Pulling pod/docker-private-registry-proxy-t6kk7 Pulling image "<http://quay.io/bentoml/proxy-to-service:v2|quay.io/bentoml/proxy-to-service:v2>"
13m Normal Pulled pod/docker-private-registry-proxy-t6kk7 Successfully pulled image "<http://quay.io/bentoml/proxy-to-service:v2|quay.io/bentoml/proxy-to-service:v2>" in 7.769060403s
13m Normal Created pod/docker-private-registry-proxy-t6kk7 Created container tcp-proxy
13m Normal Started pod/docker-private-registry-proxy-t6kk7 Started container tcp-proxy