This message was deleted BentoML #ask-for-help

Join Slack

This message was deleted.

# ask-for-help

Slackbot

02/21/2023, 5:17 PM

This message was deleted.

Chaoyu

02/21/2023, 11:15 PM

Hi @Jori Geysen - unfortunately

--reload

is designed and built for development purpose, we do not recommend using it for serving production traffic and not an ideal solution for hot loading models

Chaoyu

02/21/2023, 11:16 PM

We’ve been discussing a more “bento-native” way of doing this via a AutoReload wrapper around user’s runner instance, but it’s still just a high level design

Chaoyu

02/21/2023, 11:17 PM

Would you be interested in grabbing a zoom call to chat more about your requirements for this feature?

Jori Geysen

02/22/2023, 4:04 PM

hmm, thank you very very much @Chaoyu. I seemed to have missed that piece of the documentation about

--reload

. I think I found a decent workaround though; • I still have a prediction endpoint, but slightly updated it. Now it's updating the runner to reflect the changes in the

latest

file in the

/home/bentoml/bento/models/model_name/

directory.

Copy code

runner = bentoml.transformers.get("model_name:latest").to_runner()
svc = bentoml.Service("model_name", runners=[runner])

@svc.api(
    input=JSON(pydantic_model=DataPoint), output=JSON(), route="api/v1/ops/predict"
)
async def predict(data_point) -> list:
    runner.models.clear()
    latest_model = bentoml.models.get("model_name:latest")
    runner.models.append(latest_model)
    return await runner.async_run(data_point.dict(), truncation=True)

• I still have another endpoint which updates the model and the

latest

file in the

/home/bentoml/bento/models/model_name/

directory: ◦ Download a model in the

/home/bentoml/downloaded_models

directory in the container. ◦ Call the

bentoml.transformers.save_model

method with a

transfomer_pipeline

pointing to the

/home/bentoml/downloaded_models

directory, which contains the downloaded model. This saves the newly downloaded model as a new directory in

/home/bentoml/bento/models/model_name/YYYY

directory and updates

latest

to point to

YYYY

. This way, the runner is always running the

latest

model. Same questions here; are you seeing any red flags here related to scaling of the runners or other concerns? Thanks again in advance 🙂

Jori Geysen

02/22/2023, 4:10 PM

And to answer your question re the zoom call; I already have a call scheduled with Bo next week, which will be super helpful 🙂

4 Views

Open in Slack

Previous Next