Hi again. We have a task: we used to have Datahub ...
# advice-metadata-modeling
e
Hi again. We have a task: we used to have Datahub connected to or main dwh on Hive, and we cave all descriptions, glossary terms tags etc filled on Hive datasource. Now we want to switch Datahub connecntion to Presto, which is connected to the same dwh, but it's much faster. So the issue is copying all metadata (descriptions and other editable properties, owners, domains etc) and transfer it from Hive datasource to Presto datasourse in Datahub,. The rest is 100% identical: all datasets, contaners, enviromments, schemas - it is literally the same dwh, just a different connetor to it, few different letters and a browse path. Any advice how to transfer all metadata from one datasourse to another in Datahub? Is there a maybe a special function for transferring metadata?
πŸ“– 1
πŸ” 1
l
Hey there πŸ‘‹ I'm The DataHub Community Support bot. I'm here to help make sure the community can best support you with your request. Let's double check a few things first: βœ… There's a lot of good information on our docs site: www.datahubproject.io/docs, Have you searched there for a solution? βœ… button βœ… It's not uncommon that someone has run into your exact problem before in the community. Have you searched Slack for similar issues? βœ… button Did you find a solution to your issue? ❌ Sorry you weren't able to find a solution. I'm sending you some tips on info you can provide to help the community troubleshoot. Whenever you feel your issue is solved, please react βœ… to your original message to let us know!
a
This could be accomplished via API or CLI I believe- @gray-shoe-75895 are you familiar with any other methods for doing a platform swap like this?
a
We don’t have anything out of the box for this. We do have a
datahub migrate dataplatform2instance
, which adds data platforms, so you should be able to base your code off of that
e
@gray-shoe-75895 thank you! Although in dataplatform2instance documentation I don't see the list of aspects that will be migrated, it just says "for example documentation or added tags". But there are a lot of different metadata aspects - source descriptions, containers' descriptions, tables description with all the documentation, tags, terms, domains, table fields with their descriptions tags and terms... is there somewhere I can see the list of aspects that's affected by this command?
a
The list is here https://github.com/datahub-project/datahub/blob/98c4e92fd08b5fdf4e4ef45b66159024a1059c99/metadata-ingestion/src/datahub/cli/migration_utils.py#L[…]5. Note that to make this work for your use case, it will probably require a fair bit of custom code
e
The paige is not available to me. Do I need some kind of additional access?
e
thank you! I'll try and see if will be able to customize it for our case