https://datahubproject.io logo
#random
Title
# random
s

silly-summer-53814

06/16/2022, 1:19 AM
Hi everyone, can anyone show me how datahub get the min, max, mean, median, distinct value of source like hive, mysql, oracle, postgres, db2
s

silly-summer-53814

06/16/2022, 1:25 AM
many thanks
I need some document, as I am not familar with python
so I suggest does we have some doc for all the source
b

better-orange-49102

06/16/2022, 1:27 AM
https://datahubproject.io/docs/generated/ingestion/sources/hive if u just want the config to enable profiling, can check the respective source

https://www.youtube.com/watch?v=d4S7RgWUg5U

thank you 1
s

silly-summer-53814

06/16/2022, 1:36 AM
many thanks
After finish reading your link and video, I have a better understand of it. Many thanks.
@better-orange-49102
b

big-carpet-38439

06/16/2022, 4:02 PM
Wonderful! Hoping you can simply enable "profiling" when using the Hive ingestion source
s

straight-refrigerator-31859

06/21/2022, 6:00 PM
It is pretty cool ! I am new on datahub but trying to enable profiling on hive got the following issue: org.apache.hadoop.hive.ql.parse.SemanticException:Table not found ge_temp_0ce5542c. Any idea about the root cause? I don’t see any issues on permissions though, thank you!!
2 Views