Slackbot
03/24/2023, 10:20 PMMike Kuhlen
03/25/2023, 9:24 PMMike Kuhlen
03/25/2023, 9:25 PMMike Kuhlen
03/25/2023, 9:26 PMMike Kuhlen
03/25/2023, 9:26 PMEric Riddoch
03/27/2023, 8:51 PMEric Riddoch
03/27/2023, 8:52 PMMike Kuhlen
03/27/2023, 9:33 PMOr are you hosting each bento individually, and using REST to send the dataframe from the root to the others?Yes, that’s what we do.
Mike Kuhlen
03/27/2023, 9:34 PMDo you think this would be both cost-effective and stable at low-volume? Like, one advantage I can see of using BentoML’s directly in your DAG orchestrator is that you are only paying to run your Bento when it’s actually needed. Once the data volume gets high (frequent) I imagine you’d always want a bento running.Yes, that’s a good point. We’re in a place where just keeping them running is ok. And we do use the auto-scaling, so when we’re not making heavy use of the bentos, they’re down to 2 pods and pretty low cpu/mem requests (the limits are higher).
Eric Riddoch
03/27/2023, 11:57 PMsauyon
03/31/2023, 7:41 AMserver = bentoml.HTTPServer("my_bento:version")
with server.start() as client:
client.classify(...)