@loud-island-88694 we actually have an internal workflow engine (open source here:
https://github.com/insitro/redun) that we use for all of our internal analysis pipelines. I want to write something to take the internal executions and get the s3 artifacts + metadata into datahub automatically. From the openlineage docs, we originally thought the openlineage --> datahub was already done, so we only needed to implement redun --> openlineage. Any suggestions? It seems like now the benefit of implementing the openlineage representation in the middle is if datahub implements its own openlineage ingestion, or we want to re-use the lineage for other tools that do.