Akash Patel
08/29/2024, 4:30 PMArthur Catrisse
08/30/2024, 12:29 PMFlinkDeployment
and did set
high-availability.type: kubernetes
high-availability.storageDir: <s3://my-bucket/recovery>
kubernetes.jobmanager.replicas: "2"
(link to our other issue)Akash Patel
08/30/2024, 1:46 PMArthur Catrisse
09/04/2024, 2:34 PMAbout HA, it is benefitial to use it for production in order for job to keep track of the checkpoints and help to recover from last checkpoint automatically.
It is maybe not clearly stated in Flink doc for Kubernetes HA but in K8s HA case, there is always only 1 JM pod exists, there is NO standby replicated JM pods created.
HA only works here as a job store, which this information related to the latest snapshot is getting passed to the newly created JM pod if existing one dies, to recover the job automatically.