just wondering, how best i can use Datahub for (meta)data versioning. something similar to https://github.com/iterative/dvc for datasets versioning
i
incalculable-ocean-74010
02/08/2022, 5:31 PM
Hello @important-machine-62199
What do you expect to with metadata versioning? Datahub already keeps that information stored in it’s databases.
I believe @mammoth-bear-12532 is working on this area, working on a history api.
i
important-machine-62199
02/09/2022, 4:13 AM
a way of track & present, 1) correlation(results vs. schema/layout's of dataset, datajob, data-pipeline's) 2) presentation of '1' as visual representation rather present text based ones.
I believe, presently datahub is storing schema diffs(as dataset schema changes), is there a way/api to get that info.
i
incalculable-ocean-74010
02/09/2022, 2:40 PM
Right now there isn’t an API to get schema diffs. We are actively working on it right now though! If you can’t wait an alternative would be to query the database directly.
m
mammoth-bear-12532
02/09/2022, 4:54 PM
Hi @important-machine-62199, this (API + cli support for metadata versioning) would be available in next month's release.