ancient-queen-15575
05/03/2023, 4:21 PMbucket_duration
variable?
An initial run of a snowflake ingestion I’m trying takes about 3 minutes. If I use stateful ingestion and remove the ignore_start_time_lineage: true
line, then a rerun takes about 30s. That seems great but what I understood from the docs is that only lineage changes from the past day will be picked up like this. It would be nice if the past few days were checked incase Datahub went down for a few days.
Is there a way to configure checking, for example, the past 3 days? I see there’s a bucket_duration
variable that’s an enum, but what are the accepted values for it? I can’t see any documentation for that.lively-cat-88289
05/03/2023, 4:21 PMhundreds-photographer-13496
05/04/2023, 10:57 AMstart_time: "2023-04-21T00:00:00Z"
and ingest missing lineage as a one-off activity.ancient-queen-15575
05/04/2023, 11:54 AMhundreds-photographer-13496
05/05/2023, 4:42 AMdoes lineage ingestion only ever check for the last day when using stateful ingestion?thats correct.
hundreds-photographer-13496
08/23/2023, 11:27 AM