Hi Folks , Do we have metrics on flink checkpointing to track the reason for failure ?
At present there seems to be a metric for the failure as a whole , but was looking for a more granular metric ( on the type for failure ) ?
Use case : To compare the flink jobs checkpointing across underlying different file systems