Hi All, Currently the <K8s ingestion cron> chart ...
# all-things-deployment
i
Hi All, Currently the K8s ingestion cron chart component creates a CronJob resource hardcoded to launch the metadata ingestion framework for a given configuration. This is not very flexible in cases where you may want to customize the output of the metadata ingestion framework or launch something else altogether. I found this need myself when trying to enrich the output of the ingestion framework before sending the MCEs to DataHub. In order to do so I generalized the ingestion chart to allow the possibility to define a generic shell command with custom logic through a bash script. If anyone think this is useful let me know and I'll open a PR. cc @gray-shoe-75895 @mammoth-bear-12532
b
Thanks for the details Pedro. Cc @early-lamp-41924
Are you saying that the shell command intercepts metadata prior to it being sent to DH?
m
@incalculable-ocean-74010: have you taken a look at the Transformers support recently added? https://datahubproject.io/docs/metadata-ingestion#transformations
i
My bash script happens to do that but you can run anything.
@mammoth-bear-12532 I think transformers were not ready when I did this work
I'm just asking whether you guys are interested in a more generalized ingestion cron job. I can also submit the PR and you can let me know 🙂
m
Definitely open the PR and we can discuss there! Now that transformers are supported, would prefer that be the primary way to customize ingestion post-processing. But there might be other reasons to customize the ingestion script beyond that as well as you said.