Hi all, I am planning to deploy the data hub with ...
# getting-started
c
Hi all, I am planning to deploy the data hub with integrates to HUDI and delta on Azure ADLS Gen2 are there any quick start documentation which can help me for the same?
m
Hi @calm-student-43079 are you looking for integration guides to connect Datahub with hudi and delta lake. Or looking to deploy DataHub on azure?
c
Hi @mammoth-bear-12532 I m looking for deploying data hub on aks (k8s) and then connect to hudi and delta lake which resides on adls gen 2
m
We don’t have specific guides for deploying DataHub on Azure. But you should be able to get started with the k8s artifacts that we publish (like the helm charts)
Wrt hudi : the hudi project hosts the docs for the integration: https://hudi.apache.org/docs/syncing_datahub/
c
Thank you for the response
m
For delta-lake integration, we don’t have adls support yet (the current connector works with s3)
But there is quite a bit of interest in that, so we will probably work on it shortly
You can get around the limitation if you connect to hive directly (if you have a metastore on top of delta lake)
c
So even if I have hudi in adls gen2 I can if I have hive meta store
And the same can work with delta too
m
Yes, the integration with hudi is a push based integration which means you get metadata pushed to Datahub whenever it changes which is nice
Hive integration is pull-based, which means you are extracting metadata in batch manner which makes it less fresh
So you can mix and match : use hudi native integration for the hudi case and hive on delta for delta tables on adls
Or wait a month for delta on adls to be available natively
c
Is there any active PR implementing the adls gen2 support ?
Would love to contribute if possible
m
That would be great
There isn’t an active PR yet
There is an iceberg source that works on top of adls today
So it would be ideal to refactor that out and use it across iceberg and delta sources.
c
Perfect thanks I would love to contribute will go through process of sending PR
b
@calm-student-43079: Did you worked on this any further? adls gen2 support is also very interesting for us.
c
Sorry for the late reply 😞 I am planning to work on it soon
c
Hi, there! Let me know if you guys need more help on this, I have interest in adls gen2 support too.