high-hospital-85984
09/21/2021, 10:00 AMdatahub ingest list-runs
and got presented with something unexpected. Most IDs are random GUIDs, and one suspicioulsy large run wrt row count is called no-run-id-provided
. We primarily use the kafka sink, is there a way of providing some human-readable name to the runs for easier rollback?mammoth-bear-12532
no-run-id-provided
rows were ingested before the ingestion framework had the ability to add run ids on ingestion. I have been thinking about this readability issue as well. Today, run ids can be specified in the ingestion yaml as part of config Eg run_id: looker
will attach static run_id to each run.mammoth-bear-12532
datahub ingest show —run-id RUN_ID
will provide you a summary of each run with sample rows ingestedhigh-hospital-85984
09/21/2021, 2:57 PMmammoth-bear-12532
high-hospital-85984
09/21/2021, 4:01 PMexport RUN_ID_SUFFIX=$(date +%s)
before the run and have run_id: looker_${RUN_ID_SUFFIX}
in the config. Works for us as we anyway run some preparation scripts before the actual ingestionmammoth-bear-12532
loud-island-88694
high-hospital-85984
09/21/2021, 7:31 PMmammoth-bear-12532