Hi everyone, I have ingested parquet files into DH...
# ingestion
l
Hi everyone, I have ingested parquet files into DH from GCS. I is there a cleaner way of ingesting lineage data for parquet files that have been partitioned than my example in the comments?
Copy code
lineage:
  - entity:
      name: filepath.artifacts.inventory.parquet.dc000000000000.parquet
      type: dataset
      env: PROD
      platform: gcs
    upstream:
      - entity:
          name: analytics_processed.inventory
          type: dataset
          env: PROD
          platform: bigquery
  - entity:
      name: filepath.artifacts.inventory.parquet.dc0000000000001parquet
      type: dataset
      env: PROD
      platform: gcs
    upstream:
      - entity:
          name: analytics_processed.inventory
          type: dataset
          env: PROD
          platform: bigquery
g
Hi @late-addition-48515 As per doc this source doesn't support lineage https://datahubproject.io/docs/generated/ingestion/sources/gcs
@dazzling-judge-80093 Could you please take a look
l
@helpful-carpet-81510 I know - I am importing the lineage https://datahubproject.io/docs/generated/ingestion/sources/file-based-lineage