I setup the k8s datahub using helm <https://github...
# getting-started
b
I setup the k8s datahub using helm https://github.com/acryldata/datahub-helm. I can run the service successfully in my cluster. Pods are running healthy. How can I connect to my hive tables? Where can I find the documentation to setup connect datahub to hive or ingest my hive metadata to datahub?
l
b
I see this page in docker. But where can I setup in k8s recipes?
l
Ingestion runs separately and we don't currently support running it within the same k8s cluster. you can run it on a scheduled basis using something like Airflow
b
We recommend deploying the DataHub service to K8s as you've done, then pointing your ingestion jobs to that deployment (over ingress). The reason these are separate is that one of DataHub's primary use cases is to push metadata from across a complex data ecosystem that spans execution environments
b
I see that there is a crob job:
I need pull metadata from hive server.
need to figure out configuration in this crob job
b
Correct - you can use the cron job also 🙂
b
ok, setup the cron job, trigger the ingestion.
yes, thank you John. That could be the start point
g
@lemon-receptionist-90470
We recommend deploying the DataHub service to K8s as you've done, then pointing your ingestion jobs to that deployment (over ingress).
👍 2
👀 1