cool-architect-34612
04/10/2022, 11:11 PMcool-architect-34612
04/11/2022, 3:40 AMcold-jelly-78088
04/11/2022, 10:39 AMmillions-notebook-72121
04/11/2022, 11:05 AMalert-football-80212
04/11/2022, 12:16 PMnutritious-jackal-99119
04/11/2022, 1:15 PMmodern-monitor-81461
04/11/2022, 7:34 PMExplore
like views as opposed to Superset being strictly a BI tool (but it does offer virtual datasets, a bit like Explores, no?)square-solstice-69079
04/12/2022, 8:18 AMhallowed-analyst-96384
04/12/2022, 6:59 PMtype: datahub-business-glossary
config:
file: business_glossary.yml
sink:
type: datahub-rest
config:
server: "${DATAHUB_REST_HOST}:${DATAHUB_REST_PORT}"
but after exporting DATAHUB_REST_HOST=localhost
and DATAHUB_REST_PORT=8080.
I do ingestion and get this error: UnboundVariable: 'DATAHUB_REST_HOST: unbound variable'.
microscopic-mechanic-13766
04/13/2022, 8:28 AMbrave-forest-5974
04/13/2022, 12:26 PMfamous-match-44342
04/13/2022, 1:06 PMhallowed-analyst-96384
04/13/2022, 4:35 PMERROR - ('Unable to emit metadata to DataHub GMS', {'message': "HTTPConnectionPool(host='localhost', port=80): Max retries exceeded with url: /entities?action=ingest (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f16c9a1c970>: Failed to establish a new connection: [Errno 111] Connection refused'))"})
The Airflow connection is well set:
`airflow.connections`:
- id: datahub_rest_default
type: datahub_rest
host: <http://localhost>
port: 8080
And the Airflow configuration file gets the right information: AIRFLOW__LINEAGE__DATAHUB_KWARGS: '{ "datahub_conn_id": "datahub_rest_default", "capture_ownership_info": true, "capture_tags_info": true }'
I would like to add that the following Airflow was deployed in a Kubernetes environment using :
airflow:
image:
repository: acryldata/airflow-datahub
tag: latest
famous-match-44342
04/14/2022, 4:00 AMgreat-dentist-95905
04/14/2022, 9:12 AMclean-nightfall-92007
04/15/2022, 6:56 AM2022-04-15 05:05:01.500:INFO::main: Logging initialized @420ms to org.eclipse.jetty.util.log.StdErrLog
WARNING: jetty-runner is deprecated.
See Jetty Documentation for startup options
<https://www.eclipse.org/jetty/documentation/>
2022-04-15 05:05:01.539:INFO:oejr.Runner:main: Runner
2022-04-15 05:05:01.736:INFO:oejs.Server:main: jetty-9.4.20.v20190813; built: 2019-08-13T21:28:18.144Z; git: 84700530e645e812b336747464d6fbbf370c9a20; jvm 1.8.0_302-b08
2022-04-15 05:05:03.302:WARN:oejw.WebAppContext:main: Failed startup of context o.e.j.w.WebAppContext@3339ad8e{/,null,UNAVAILABLE}{file:///datahub/datahub-gms/bin/war.war}
java.util.zip.ZipException: invalid entry CRC (expected 0xe78b2198 but got 0xde56297a)
at java.util.zip.ZipInputStream.readEnd(ZipInputStream.java:394)
at java.util.zip.ZipInputStream.read(ZipInputStream.java:196)
at java.util.jar.JarInputStream.read(JarInputStream.java:207)
at org.eclipse.jetty.util.IO.copy(IO.java:172)
at org.eclipse.jetty.util.IO.copy(IO.java:122)
at org.eclipse.jetty.util.resource.JarResource.copyTo(JarResource.java:218)
I found this error when using the official image to build;acceptable-architect-70237
04/15/2022, 3:57 PMDB.Table1
and DB.Table2
conforms Datahub's UpstreamLineage Aspect definition.
DB.Tabe1 -> DB.Table2
Later on, I ingested another upstream lineage for DB.Table3
as such with correct aspect definition
DB.Table2 -> DB.Table3
My question is, In Datahub, will DB.Table3
automatically show the following deriving relationship. Or I query any node (DB.Table1, DB.Table2), it will automatically show the upstream
and downstream
if available?
DB.Table2 -> DB.Table2 -> DB.Table3
I would assume it should show as such. Otherwise, we could just use upstreamlineage
and downstreamlineage
aspects to render the UI, don't need to use the graphdb at all.fierce-city-89572
04/15/2022, 4:09 PMfresh-memory-10355
04/15/2022, 6:16 PMadorable-receptionist-20059
04/15/2022, 9:06 PMgentle-camera-33498
04/18/2022, 7:44 PMalert-football-80212
04/19/2022, 8:40 AMquaint-lighter-81058
04/19/2022, 6:17 PMsquare-solstice-69079
04/20/2022, 9:38 AMmammoth-fountain-32989
04/20/2022, 1:01 PMhallowed-analyst-96384
04/20/2022, 1:26 PMbreezy-noon-83306
04/20/2022, 3:07 PMripe-alarm-85320
04/20/2022, 10:08 PMsalmon-area-51650
04/21/2022, 12:22 PMwooden-chef-22394
04/22/2022, 11:33 AM