https://datahubproject.io logo
#getting-started
Title
# getting-started
b

busy-analyst-8258

06/14/2022, 8:04 PM
Hello All, There discrepancy causing counts to differ for tables and DBs on MDH home page between datasets and platforms. how to identify what cause the difference ?
b

bulky-soccer-26729

06/14/2022, 8:08 PM
hi Geetha! could you post a little more info about this issue here? on datahub's home page you're seeing numbers not add up between things that should? screenshots are also welcome
b

busy-analyst-8258

06/14/2022, 8:27 PM
Hello Chris Collins, yes in the homepage
under Explore your Metadata Dataset as 48.2K under Platforms oracle as 24.8K,Greenplum as 24.6k Sqlserver as 1.1k so the dataset entities count and platform counts are not matching
b

bulky-soccer-26729

06/14/2022, 8:33 PM
ahh interesting. We've actually heard this before quite recently as well. Would you mind opening up a github issue here for this so we can ensure we have time to explore where these discrepancies are coming from? just know it's on our radar! I don't want to post any guesses I might have without digging further.
b

busy-analyst-8258

06/14/2022, 8:46 PM
thanks for your reponse Chris Collins, Sure will open the github issue.
b

bulky-soccer-26729

06/14/2022, 8:54 PM
thank you!
b

busy-analyst-8258

06/14/2022, 9:02 PM
created the github issue A short description of the bug #5172
b

bulky-soccer-26729

06/14/2022, 9:02 PM
amazing
b

big-carpet-38439

06/15/2022, 12:11 AM
Most likely it’s because of Containers (databases, schemas) which are not accounted for in Datasets.
I don’t think this is a bug
b

busy-analyst-8258

06/15/2022, 12:55 PM
Hello John Joyce, thanks for the input , can you please elaborate in details . Thank you.
Hello John Joyce,
as per your suggestion i have checked the counts for the Dataset and Platforms and got the difference, the difference is matching with the container counts , so the analysis is correct?
Prod Dataset Count Oracle 64376 Greenplum 13260 MSSQL 8490 total 86126
Platform Count Oracle Greenplum MSSQL Dataset 64376 13260 8490 Container 3455 181 336 Total 67831 13441 8826
Platform Count Oracle Greenplum MSSQL Dataset 64376 13260 8490 Container 3455 181 336 Total 67831 13441 8826 Grand Total : 90098 Difference: 3972 Container total= 3972
b

bulky-soccer-26729

06/16/2022, 7:19 PM
there we go that makes sense!
thanks so much for looking into this Geetha
b

big-carpet-38439

06/16/2022, 7:21 PM
@busy-analyst-8258 Trying to follow the above - are you saying that the total does not add up or does?
b

busy-analyst-8258

06/16/2022, 7:40 PM
Data set count matches with Platform count if we remove the containers from platform..
k

kind-dawn-17532

06/22/2022, 2:39 PM
@big-carpet-38439 is this an intended behavior? If yes, do we want to clarify this in the UI and docs somewhere? This will create confusion for the users otherwise..