shy-parrot-64120
03/10/2022, 6:47 PMyaml-anchors
? like this:
version: 1
lineage:
- entity: &dataset
name: report.payment_reconciliation
type: dataset
platform: postgres
platform_instance: mvp
upstream:
- entity: &datajob
name: report.load_payment_reconciliation
type: datajob
platform: postgres
platform_instance: mvp
- entity:
<<: *datajob
name: report.load_payment_reconciliation
upstream:
- entity:
<<: *dataset
name: core.payment
- entity:
<<: *dataset
name: core.ph2_transaction
- entity:
<<: *dataset
name: core.ph2_order
afaiks answer is no
have you any plans to do like this?lemon-hydrogen-83671
03/10/2022, 6:49 PMshy-parrot-64120
03/10/2022, 6:49 PMsource:
type: plpgsql
config:
database: mvp
dbms: postgres
pipelines:
- group: payments
entities:
- name: report.load_payment_reconciliation
sources:
- core.payment
- core.ph2_transaction
- core.ph2_order
target: report.payment_reconciliation
lemon-hydrogen-83671
03/10/2022, 6:49 PMshy-parrot-64120
03/10/2022, 6:50 PMlemon-hydrogen-83671
03/10/2022, 6:51 PMlemon-hydrogen-83671
03/10/2022, 6:51 PMshy-parrot-64120
03/10/2022, 6:53 PMdatahub % yq 'explode(.)' plpgsql_data.yml
shy-parrot-64120
03/10/2022, 6:53 PMlemon-hydrogen-83671
03/10/2022, 6:55 PMshy-parrot-64120
03/10/2022, 7:13 PMshy-parrot-64120
03/10/2022, 7:14 PMshy-parrot-64120
03/10/2022, 7:14 PMConfigurationError: Type must be one of ['dataset'], datajob is not yet supported.
shy-parrot-64120
03/10/2022, 7:17 PM---
version: 1
lineage:
- entity: &dataset
name: report.payment_reconciliation
type: dataset
platform: postgres
platform_instance: mvp
upstream:
- entity:
<<: *dataset
name: core.payment
- entity:
<<: *dataset
name: core.ph2_transaction
- entity:
<<: *dataset
name: core.ph2_order
dmytro.kulyk@MB-DAT-564087 datahub % datahub ingest -c plpgsql.yml
[2022-03-10 21:16:00,846] INFO {datahub.cli.ingest_cli:70} - DataHub CLI version: 0.8.28.0
[2022-03-10 21:16:01,241] INFO {datahub.cli.ingest_cli:86} - Starting metadata ingestion
[2022-03-10 21:16:01,246] INFO {datahub.ingestion.source.metadata.lineage:175} - preserve_upstream is set to True
[2022-03-10 21:16:01,246] INFO {datahub.ingestion.source.metadata.lineage:122} - Upstream detected for env='PROD' name='report.payment_reconciliation' type='dataset' platform='postgres' platform_instance='mvp'. Extracting urn...
[2022-03-10 21:16:01,911] INFO {datahub.ingestion.run.pipeline:85} - sink wrote workunit lineage-urn:li:dataset:(urn:li:dataPlatform:postgres,mvp.report.payment_reconciliation,PROD)
[2022-03-10 21:16:01,911] INFO {datahub.cli.ingest_cli:88} - Finished metadata ingestion
Source (datahub-lineage-file) report:
{'workunits_produced': 1,
'workunit_ids': ['lineage-urn:li:dataset:(urn:li:dataPlatform:postgres,mvp.report.payment_reconciliation,PROD)'],
'warnings': {},
'failures': {},
'cli_version': '0.8.28.0'}
Sink (datahub-rest) report:
{'records_written': 1,
'warnings': [],
'failures': [],
'downstream_start_time': datetime.datetime(2022, 3, 10, 21, 16, 1, 475220),
'downstream_end_time': datetime.datetime(2022, 3, 10, 21, 16, 1, 911311),
'downstream_total_latency_in_seconds': 0.436091,
'gms_version': 'v0.8.27'}
shy-parrot-64120
03/10/2022, 7:25 PMshy-parrot-64120
03/10/2022, 7:30 PMlemon-hydrogen-83671
03/10/2022, 7:55 PMEntityConfig
https://datahubproject.io/docs/metadata-ingestion/source_docs/file_lineage/shy-parrot-64120
03/10/2022, 8:54 PManchoring