Hello there, I am trying to modiify superset.py f...
# ingestion
k
Hello there, I am trying to modiify superset.py for the ingestion and add more lineage to the existing one. I noticed that if I emit lineage via DatahubRestEmitter then it works completely fine. However if I emit lineage via
self.report.report_workunit
then I do not see the workunit being sent to the server. I couldn't figure out why, so I'm asking here. Is there a way to submit lineage work with this workunit system?
s
The report_workunit method is only there for reporting how many workunits were successfully processed (or with errors, warnings, etc.), to send the workunits to the server you have to yield them in the get_workunits method of the source (in fact yielding the workunits in the get_workunits methods is not sending them to the server, instead the workunits are passed to the configured sink which then sends them to the server).
k
Gotcha thanks, I also figured it out later, workunit report has nothing to do with actually submitting the work to be processed. the way to get it processed is by yielding it