Hi everyone, first of all, thank you all for devel...
# ingestion
h
Hi everyone, first of all, thank you all for developing such a great tool. I have been trying to ingest a simple recipe using the datahub's ui but I want this recipe to ingest properties as well. I get a success message on ingest but no properties get added to my ingested files. I will post my recipe on the thread. I've seen there is a post already suggesting to start with the mce files from the examples but I am talking specifically on how to achieve the ingestion via the ui in accordance with the documentation here: https://datahubproject.io/docs/metadata-ingestion/docs/transformer/dataset_transformer#simple-add-dataset-datasetproperties
{ transformers: [ { type: "simple_add_dataset_properties", config: { properties: { prop1: "pikachu" }, }, }, { type: "simple_add_dataset_tags", config: { tag_urns: [ "urnlitag:Test Tag", ], }, }, { type: "simple_add_dataset_terms", config: { term_urns: [ "urnliglossaryTerm:Test Glossary Term", ], }, }, { type: "simple_add_dataset_domain", config: { semantics: "OVERWRITE", domains: ["urnlidomain:Test Domain"], }, }, ], sink: { type: "datahub-rest", config: { server: "http://datahub-gms:8080" }, }, source: { type: "s3", config: { // intentionally removed this before posting to slack }, }, };
If I modify the transform, again following the guide posted above, to something like this:
Copy code
{
  type: "simple_add_dataset_properties",
  config: {
    replace_existing: true,
    properties: { ingestedDate: "2022-10-06T07:45:37.882Z" },
  },
},
I get an error stating that no extra fields are permitted, complaining specifically about the
replace_existing
field which is part of the documentation for
simple_add_dataset_properties
, which seems very suspicious of a bug or outdated documentation.
nevermind... I just found out what the problem is... which is weird... I had to move that transform to the bottom of the transformers list to make it work
d
I’m glad it works now