Hi :wave: I am using the flink-operator 1.9.0 on E...
# troubleshooting
j
Hi šŸ‘‹ I am using the flink-operator 1.9.0 on EKS (with karpenter - on-demand instances). I can't isolate the behaviour, but when I am deploying one single job... some others jobs are impacted, returning:
Copy code
2024-07-15 11:37:54
org.apache.flink.util.FlinkExpectedException: The TaskExecutor is shutting down.
	at org.apache.flink.runtime.taskexecutor.TaskExecutor.onStop(TaskExecutor.java:481)
	at org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStop(RpcEndpoint.java:239)
the job (eventually) recovers pretty fast by itself
for the ones dealing with this issue.... yes, karpenter consolidation creates this behaviour (pods are killed with this exception while the cluster optimizes the nodes structure). Disabling will fix it