Hey all! I'm exploring datahub as a possible proje...
# getting-started
b
Hey all! I'm exploring datahub as a possible project to take on in my new role, and was wondering what data access datahub gets when it's connected to a database? I didn't see this mentioned in the documentation anywhere. Is there no issue because it's hosted locally on your docker image? Thanks for what may be a silly question! Sorry if this a silly question! If you could point me in the right direction that'd be super helpful!
📖 1
🩺 1
🔍 1
2
l
Hey there 👋 I'm The DataHub Community Support bot. I'm here to help make sure the community can best support you with your request. Let's double check a few things first: ✅ There's a lot of good information on our docs site: www.datahubproject.io/docs, Have you searched there for a solution? ✅ button ✅ It's not uncommon that someone has run into your exact problem before in the community. Have you searched Slack for similar issues? ✅ button Did you find a solution to your issue? Yes button No button
l
Hi @best-yacht-69562! There’s a bit of variability between databases, but generally speaking DataHub is extracting metadata from: • System tables (ie.
information_schema
, catalogs, etc.) to determine what datasets are available • Query logs - what queries have been executed & by whom - this informs our lineage graph & overall usage stats • Datasets themselves for high-level data profiling I encourage you to check out our docs for the specific database you’re interested in - you should be able to find detail on what level of access is required & for what purpose https://datahubproject.io/integrations/
b
Omg the integration page had it all 🙈
That's exactly what I needed. Thank you!!