Another question I've been wondering about is this...
# general
a
Another question I've been wondering about is this idea of both realtime and offline tables being queried at once, via the same table name. Does anyone have an interesting use case for when they've used this? I'm trying to wrap my head around one
m
Yes this is a very common pattern. What’s your question on this one?
a
really just trying to imagine where this would be useful. in my cases, our streaming data sources are usually so different from our batched data, I'm wondering why I'd want to query them at the same time
do you have an example of this in practice?
m
By example you mean a config setup? Or just want to know who is running it?
a
latter, just a scenario in which it makes sense
m
I think many of LinkedIn’s use cases follow that pattern. For example, “who viewed my profile” that is powered by Pinot follows that
Real-time ingestion gives you freshness. Offline gives you opportunity to pre-aggregate, correct stream error etc
So you get best of both worlds. Does that make sense?
a
ahhh yeah totally. ty!