Hello everyone, do we have any guide to deploy datahub in a production environment? Any production-ready guidelines and best practices?
b
big-carpet-38439
06/06/2022, 5:09 PM
Hi @adorable-addition-82413!
Specs will differ depending on your company size (number of datasets, users, etc) but a basic setup on AWS usually works fine for most companies of mid-large size.
Typically people follow this AWS guide https://datahubproject.io/docs/deploy/aws
We use 3 EC2 m3.large for a Kubernetes (EKS) cluster on AWS.
In production, most folks deploy using Helm: https://datahubproject.io/docs/deploy/kubernetes
a
adorable-addition-82413
06/07/2022, 12:04 PM
Thank you, i'll be using GCP instead of AWS, but i'll try to follow these docs.
b
big-carpet-38439
06/07/2022, 2:37 PM
Wonderful - Since we don't have a GCP doc - it would be AMAZING if you'd be willing to document your steps. Maybe we can turn it into a GCP guide!
a
adorable-addition-82413
06/10/2022, 2:36 PM
Hey @big-carpet-38439. How you doing?
I was wondering when should i use elasticsearch as a service, in GCP we have the elasticsearch cloud. Is it ok to manage the elasticsearch on my cluster following the quickstart in a production use?