Hello everyone, I deployed DataHub in GKE using He...
# troubleshoot
g
Hello everyone, I deployed DataHub in GKE using Helm. I have 4 replicas of GMS and 2 replicas of frontend, but I still have a lot of these errors. Can Anyone know what it could be?
I enabled cache on the search service and increased the batch return to 300. Can this change cause this problem?
o
Do you have the actual error occurring on GMS that is responding with a 500?
From this it looks like GMS is unable to connect to your database.
g
I understand. I'll try to update the cluster's secrets and see if that solves the problem.
But wait.. The job of restoring the indexes is running successfully and uses the same access parameters as the GMS
even doing the restoration of the indices, I constantly get this type of error:
o
A 500 error coming back from GMS will always have an underlying application error happening (barring 50X series that are specifically at the connectivity layer like gateway timeouts). There should be an error happening on GMS each time you get that, verify which one it is and that should tell you what's going on. If you're consistently getting the connection down error, there is something going on with GMS's connection to your database. It doesn't make a lot of sense though if restore indices is working as they use the exact same configuration classes.
g
I'll try to see if it could be some connection/session configuration in the Cloud SQL instance that is hosting the MySQL database. Thanks a lot @orange-night-91387 It has already given me a light where I can start trying to solve the problem teamwork