This message was deleted BentoML #ask-for-help

Join Slack

This message was deleted.

# ask-for-help

Slackbot

05/08/2023, 4:15 PM

This message was deleted.

Aaron Pham

05/08/2023, 6:33 PM

You can create a setup script

download_weights.py

docker.setup_script=/path/to/download_weights.py

to include the cache folder into the container.

Aaron Pham

05/08/2023, 6:34 PM

wrt the float16 support, cc @larme (shenyang) for more information but afaik https://github.com/bentoml/BentoML/pull/3823 is an extension work on supporting diffusers rapid change.

Self

05/08/2023, 6:36 PM

Thanks! would it make sense to specify a model path inside a bento.yaml file to include it automatically ? Don't the models need to be in the "bento format" or can they be used just as they are loaded into the HuggingFace library ? I could go the classical route of just specifying a docker file and load everything manually, but I wonder if it's more efficient doing it the bento way

Aaron Pham

05/08/2023, 9:30 PM

For huggingface we let them to manage their own cache, as it doesn’t make sense to save the hugging face into bentoml store as it will save twice. A Bento can also exclude the model.

Self

05/09/2023, 10:02 AM

I see this would be true also for transformer LLMs ? I want to deploy a solution that already has the model files (Stable Diffusion) inside the docker container or some persistent docker volume. So I would not even use the cache directory of HuggingFace but local model files that could be several GB in size each. Is there a way to support persistent docker volumes ?

Open in Slack

Previous Next