This message was deleted.
# troubleshooting
s
This message was deleted.
s
Copy code
io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: POST at: <https://172.20.0.1:443/apis/batch/v1/namespaces/druid/jobs>. Message: Job.batch "indexkinesiscshtest267a7d9eb175361oeepckk" is invalid: spec.template.spec.restartPolicy: Required value: valid values: "OnFailure", "Never". Received status: Status(apiVersion=v1, code=422, details=StatusDetails(causes=[StatusCause(field=spec.template.spec.restartPolicy, message=Required value: valid values: "OnFailure", "Never", reason=FieldValueRequired, additionalProperties={})], group=batch, kind=Job, name=indexkinesiscshtest267a7d9eb175361oeepckk, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=Job.batch "indexkinesiscshtest267a7d9eb175361oeepckk" is invalid: spec.template.spec.restartPolicy: Required value: valid values: "OnFailure", "Never", metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Invalid, status=Failure, additionalProperties={}).
	at io.fabric8.kubernetes.client.KubernetesClientException.copyAsCause(KubernetesClientException.java:238) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.waitForResult(OperationSupport.java:538) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handleResponse(OperationSupport.java:558) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handleCreate(OperationSupport.java:349) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.handleCreate(BaseOperation.java:711) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.handleCreate(BaseOperation.java:93) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.CreateOnlyResourceOperation.create(CreateOnlyResourceOperation.java:42) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.create(BaseOperation.java:1113) ~[?:?]
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.create(BaseOperation.java:93) ~[?:?]
	at org.apache.druid.k8s.overlord.common.DruidKubernetesPeonClient.lambda$launchJobAndWaitForStart$2(DruidKubernetesPeonClient.java:77) ~[?:?]
	at org.apache.druid.k8s.overlord.common.DruidKubernetesClient.executeRequest(DruidKubernetesClient.java:46) ~[?:?]
	at org.apache.druid.k8s.overlord.common.DruidKubernetesPeonClient.launchJobAndWaitForStart(DruidKubernetesPeonClient.java:76) ~[?:?]
	at org.apache.druid.k8s.overlord.KubernetesTaskRunner.lambda$run$0(KubernetesTaskRunner.java:169) ~[?:?]
	at java.util.concurrent.FutureTask.run(FutureTask.java:264) ~[?:?]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) ~[?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) ~[?:?]
	at java.lang.Thread.run(Thread.java:829) ~[?:?]
The test baseTemplate I had adapted to the following:
Copy code
apiVersion: v1
    kind: PodTemplate
    metadata:
      name: druid
    template:
      spec:
        containers:
          - image: myregistr/druid:26.0.0_fb97cfac
            name: mydruidjob
            args:
            - /peon.sh /druid/data/persistent/task 1 --loadBroadcastSegments true
            command:
            - sh
            - -c
            resources:
              limits:
                cpu: '500m'
              requests:
                cpu: '500m'
this results ins the stacktrace above.
k
I don’t know anything about this stuff, but it seems like it wants a
restartPolicy
?
Copy code
spec.template.spec.restartPolicy: Required value: valid values: "OnFailure", "Never"
Have you tried adding something like this?
Copy code
template:
  spec:
    restartPolicy: OnFailure
s
Overlord is now trying to create K8S Jobs (but fails). I will get to a running setup here soon. Thanks a lot @Kyle Hoondert My misunderstanding was, that i thought that i have to provide a kind of spec snippet and "druid" takes care of the launching rather than creating a fully working podtemplate spec and druid ingests just the task via Environment Variables. Thats was great help
a