Slackbot
04/05/2023, 9:09 AMElijah Ben Izzy
04/05/2023, 3:04 PMmean_age
node/how its overridden).
So, Hamilton isn’t a feature store but it can be used to feed data into one/extract data out of one. You just have to be a little careful about this so you get the right data in.
The offline/online connection is “backfilling”, and its a naturally tricky problem. Hamilton won’t solve it for you, although there are common techniques of updating a “high watermark” — e.g. a time stamp that says when the latest offline/online data is. Then when you run your offline job every day, you need to update your online job as well. If your streaming job just stops, you’ll probably want to run it so it keeps up (kafka has tooling to do this naturally). Another common approach is to consolidate offline/online at some time (use offline at the end to sweep over then merge with your online stuff to generate it).Elijah Ben Izzy
04/05/2023, 3:07 PMStefan Krawczyk
04/05/2023, 5:04 PMStefan Krawczyk
04/05/2023, 11:15 PM