This message was deleted BentoML #events

Join Slack

This message was deleted.

# events

Slackbot

08/23/2022, 6:28 AM

This message was deleted.

Sean

08/23/2022, 6:31 AM

Hi Sam, currently models must be loaded back to memory to use the

save_model

api. Does this not work for your use case?

Sam Joel

08/23/2022, 6:35 AM

Hi Sean, the problem here is we are having the inference script (script which would run the inference on the model) and the model_file separately and we would like to load the model when we start the server. If I can save the model directly then I could access the model in the serving script and then load it using the model_load_function from the inference script

Sam Joel

08/23/2022, 6:35 AM

The inference script we are having has four functions preprocess, model_load, predict and postprocess.

Sam Joel

08/23/2022, 6:38 AM

If we have to load the model and then save it I would have to extract the model_load function from the inference script and then save the model, which is a bit complex so looking if there is an alternative. Our current method : model = inference_script.model_load_function(.pth_file) bentoml.pytorch.save_model(model)

Sean

08/23/2022, 6:44 AM

I see. Ideally

save_model

should be called when the

.pth

model file was created. So instead of saving to the

.pth

file, you would get a bentoml model object.

Sean

08/23/2022, 6:44 AM

But I understand due to restrictions, this may not be possible.

Sam Joel

08/23/2022, 6:46 AM

Yeah exactly. We are getting the pretrained models in their respective library save formats

Sam Joel

08/23/2022, 6:46 AM

So there is a need to recreate the model object from the inference script users are providing

Sean

08/23/2022, 6:48 AM

Loading model back into memory is usually fairly standard. Using PyTorch, it should be a few lines of Python code, if you do not wish to invoke the inference script.

Sam Joel

08/23/2022, 6:53 AM

Yeah, just looking if some simpler alternatives exist. Thanks @Sean.

👍 1

Open in Slack

Previous Next