Internally we mainly rely on various systems (incl Gobblin) to emit messages to us directly. There are also crawler scripts here and there for systems that we can't be instrumented, though they're written in Java and runs on Azkaban. The ETL scripts on GitHub mainly served as a "demo" and are not meant to be used in production verbatim.