Have the following Questions : 1. How to use a dif...
# ingestion
h
Have the following Questions : 1. How to use a different Database for GMS service, any documentation that we can follow ? 2. To run Datahub, both Graph Index and Search are required ? What are used for Graph Index and Search Index by default ?Referring to the Architecture diagram shared
m
@handsome-football-66174: by default we use Elastic for both Search and Graph index currently.
h
@mammoth-bear-12532 - What is the role of neo4j ? Also any direction on the first question is greatly appreciated. ie 1. How to use a different Database for GMS service, any documentation that we can follow ?
m
You can use neo4j for graph index as well if you want. multi-hop queries are more efficient in neo4j ... however the UI currently doesn't use them, so there isn't a strong reason for using it, unless you want to query the metadata programmatically
@handsome-football-66174 For the GMS service -> database question: there are community members using MySQL / Postgres / MariaDB / AWS RDS etc. Any thing that works with EBean works. A quick skim over the docs doesn't turn up any results for how to configure different databases though. Which database were you planning to use?
h
Planning to use Postgres DB
m
@early-lamp-41924: what are the steps for connecting datahub-gms to postgres?
e
Take a look at the docker env here https://github.com/linkedin/datahub/blob/master/docker/datahub-gms/env/docker.postgres.env#L3. You need to set EBEAN_DATASOURCE_URL to point to postgres instead of mysql and change the EBEAN_DATASOURCE_DRIVER
h
@early-lamp-41924 - If we were to modify the above is that sufficient to connect to a different DB (not running as a docker ), but a standalone DB.
m
@handsome-football-66174: yes that should work.
👍 1