Hello, I found that the dataHub version installed ...
# ingestion
m
Hello, I found that the dataHub version installed with PIP was 0.8.24. The results I got from retrieving metadata using it were different from those I got from the same database by downloading the source code and recompiling ingestion. How did this happen? Datahub metadata retrieval via PIP installation is missing. The first image is the metadata obtained by PIP download, and the second and third images are the metadata obtained by 0.8.26 version built by myself
s
You may have installed the pip module earlier and did not run an upgrade command? Try running
Copy code
python3 -m pip install --upgrade acryl-datahub
Also, try running these commands one by one
Copy code
python -c "import sys; print(sys.version); print(sys.executable); import datahub; print(datahub.__file__); print(datahub.__version__);"
Copy code
python3 -c "import sys; print(sys.version); print(sys.executable); import datahub; print(datahub.__file__); print(datahub.__version__);"
It is common that people might have different datahub versions because they might have multiple python installations at for
python
and
python3
. So that could be one possibility that you can figure out through above commands
d
We introduced domains and containers recently and containers are created automatically for these datasets. (https://medium.com/datahub-project/just-shipped-ui-based-ingestion-data-domains-containers-and-more-f1b1c90ed3a) That is why you see the difference between the old and the new version. If you would install from pip the latest version you should see the same.
plus1 1
m
I upgraded DataHub to version 0.8.26, but exporting metadata to files and REST didn't have the DatasetProperties aspect. The last image is the result of importing rest into the 0.8.26 metadata ingestion that I built locally.
r
Hey there! 👋 Make sure your message includes the following information if relevant, so we can help more effectively! 1. Are you using UI or CLI for ingestion? 2. Which DataHub version are you using? (e.g. 0.12.0) 3. What data source(s) are you integrating with DataHub? (e.g. BigQuery)
d
0.8 is a quite old version, can you try with a more recent one?