This message was deleted BentoML #ask-for-help

Join Slack

This message was deleted.

# ask-for-help

Slackbot

04/11/2023, 5:07 PM

This message was deleted.

Jim Rohrer

04/11/2023, 5:36 PM

not sure if this will work, but try

Copy code

PYTHONPATH=. python project/wf/services/save_model.py

🙏 1

Seungchan Lee

04/11/2023, 5:50 PM

ohhhh - this seems to be working, but I get another error:

Copy code

{
  "asctime": "2023-04-11 17:47:53,414",
  "name": "flytekit",
  "levelname": "WARNING",
  "message": "FlyteSchema is deprecated, use Structured Dataset instead."
}

This is just a warning though - not sure why it’s throwing an error?

Seungchan Lee

04/11/2023, 5:54 PM

@Jim Rohrer

Jim Rohrer

04/11/2023, 5:55 PM

Not sure on that one...are you specifically using Flyte anywhere or is it some subdependency?

Jim Rohrer

04/11/2023, 5:55 PM

https://docs.flyte.org/projects/cookbook/en/latest/auto/core/type_system/schema.html

Seungchan Lee

04/11/2023, 5:58 PM

We use flyte workflow to train the model:

Copy code

import torch
import os
import typing
from flytekit import workflow
from project.wf.main import Hyperparameters
from project.wf.main import run_wf

_wf_outputs=typing.NamedTuple("WfOutputs",run_wf_0=torch.nn.modules.module.Module)
@workflow
def wf_40(_wf_args:Hyperparameters)->_wf_outputs:
	run_wf_o0_=run_wf(hp=_wf_args)
	return _wf_outputs(run_wf_o0_)

But that’s just a warning - not sure why it would throw an error? Strange - I’ve never run into this issue before

Seungchan Lee

04/11/2023, 5:58 PM

save_model.py

is just taking the already trained model that’s saved locally:

Copy code

import torch
import bentoml

with open("/userRepoData/taeefnajib/PyTorch-MNIST/sidetrek/models/7b12772142b14606286a26751d27b878.pt", "rb") as f:
    model = torch.load(f)
    saved_model = bentoml.pytorch.save_model("example_model", model)

Seungchan Lee

04/11/2023, 5:59 PM

Maybe I’m not really understanding how bentoml saves the model? Wouldn’t the above file just take the trained model and save it locally in

/bentoml

? Why does bentoml care what other files exist in this case?

Seungchan Lee

04/11/2023, 6:01 PM

Or since project files here are irrelevant (i.e. the model is already trained and saved in a local file), maybe I need to exclude all other project files in bentofile.yaml ?

Jim Rohrer

04/11/2023, 6:31 PM

could be that flyte is getting included with the pytorch PT file?

Seungchan Lee

04/11/2023, 6:32 PM

Hmm that’s possible - but I was able to deploy bentoml before using flyte trained models before

Seungchan Lee

04/11/2023, 6:33 PM

Strange this is suddenly happening

Seungchan Lee

04/11/2023, 6:33 PM

And I think it’s unlikely the pt file would include flyte

Jim Rohrer

04/11/2023, 6:34 PM

yeah that's really weird. unfortunately i'm not real familiar with using Flyte

Jim Rohrer

04/11/2023, 6:34 PM

same, I can't see why it would get packaged up in there

Seungchan Lee

04/11/2023, 6:35 PM

Right - a quick question. Is bentofile.yaml used during save_model?

Seungchan Lee

04/11/2023, 6:35 PM

It shouldn’t no? It seems to be a build configuration?

Jim Rohrer

04/11/2023, 6:36 PM

correct, I don't think so. is flyte one of your dependencies in your bentofile?

Seungchan Lee

04/11/2023, 6:36 PM

No but it’s also failing during save_model, not during build

Seungchan Lee

04/11/2023, 6:37 PM

I’m having hard time understanding why even the original error would occur during save_model since I’m only supplying the function with built

.pt

file. Why would bentoml try to read the original project files?

Jim Rohrer

04/11/2023, 6:39 PM

just looking at an sklearn bento model I created, it just copied the .pkl file over and created a model.yaml file. If you look in

~/bentoml/models

you should see a folder for your model there

Jim Rohrer

04/11/2023, 6:40 PM

if you look in that folder, you'll see all the versions its created....go into the latest version and you'll see the files it saved for the model, and a model.yaml file

Jim Rohrer

04/11/2023, 6:40 PM

can you copy the model.yaml file here?

Seungchan Lee

04/11/2023, 6:40 PM

Copy code

name: example_model
version: 7rsqyxgysc6hk3uw
module: bentoml.pytorch
labels: {}
options:
  partial_kwargs: {}
metadata: {}
context:
  framework_name: torch
  framework_versions:
    torch: 2.0.0
  bentoml_version: 1.0.15
  python_version: 3.10.10
signatures:
  __call__:
    batchable: false
api_version: v1
creation_time: '2023-04-11T17:47:53.973669+00:00'

Seungchan Lee

04/11/2023, 6:41 PM

Ohh wait - maybe it’s not really a bentoml error

Seungchan Lee

04/11/2023, 6:41 PM

I’m running this as a child process in node server as part of the automation

Seungchan Lee

04/11/2023, 6:42 PM

Maybe it just prints out the warning but it’s treated as stderr, which is considered an error in our server

Seungchan Lee

04/11/2023, 6:42 PM

Sorry I think it’s my mistake 🙇‍♂️

Jim Rohrer

04/11/2023, 6:42 PM

No worries! Sometimes you just have to talk it out 🙂

Seungchan Lee

04/11/2023, 6:43 PM

Thank you so much for walking me through this! Much appreciated. But just for future reference, do you know why the original error of

module "project" not found

error happened during save_model even though it’s only using the saved local

pt

file?

Seungchan Lee

04/11/2023, 6:44 PM

Does bentoml reference the original code during save_model for some reason?

Seungchan Lee

04/11/2023, 6:44 PM

For example, if I have no code - just local

pt

file, shouldn’t bentoml save_model still work?

Jim Rohrer

04/11/2023, 6:48 PM

nah, this is just Python's crappy module structuring lol...it is weird that save_model.py doesn't seem to reference anything else in your package structure, so I'm not sure why it would matter if it can find your

project

module

Jim Rohrer

04/11/2023, 6:49 PM

you might need an

__init__.py

file in your

services

directory? not 100% sure on that one

Seungchan Lee

04/11/2023, 6:49 PM

Ahh ok - yeah I’ve been fighting the python module import system for some time now so I know what that’s like lol

Jim Rohrer

04/11/2023, 6:50 PM

haha it's so bad

😄 1

Seungchan Lee

04/11/2023, 6:50 PM

Can’t believe how hard it is to just import something in python

Seungchan Lee

04/11/2023, 6:50 PM

Thanks again for your help - really appreciate this!

Jim Rohrer

04/11/2023, 6:52 PM

most welcome 🙂

🙏 1

Seungchan Lee

04/11/2023, 7:07 PM

@Jim Rohrer Sorry just one more quick question - does save_model have a REST API counterpart?

Seungchan Lee

04/11/2023, 7:07 PM

Also for bentoml cli commands including the build?

Jiang

04/12/2023, 9:08 AM

Hi @Seungchan Lee , is

bentoml models import

what you ask?

Seungchan Lee

04/12/2023, 2:06 PM

@Jiang No I mean if there’s a REST API endpoint I can hit to do the same thing instead of running a python script or using bentoml cli. It’s just harder to automate bentoml via CI/CD when you have to run a subprocess to run them (much slower since spawning a subprocess takes 2-4 seconds to start and also harder to deal with responses, errors, etc). For example, I’m running a python script to use

save_model

as part of a CI/CD automation, but in order to get the resulting bento model version generated by

save_model

, I have to

print()

to stdout and parse that which is not great. Also a simple warning prints to

stderr

which makes things more brittle as it’ll be treated as an error and fail to proceed to next step.

🍱 1

Seungchan Lee

04/12/2023, 2:12 PM

Please let me know if there’s a REST api for bentoml so I can replace the scripts and cli subprocesses

Jim Rohrer

04/12/2023, 2:15 PM

Unless you’re running Yatai in Kubernetes, there’s no REST API equivalent for the bento cli commands, at least none that I’m aware of.

Jim Rohrer

04/12/2023, 2:17 PM

Theoretically you could build a REST wrapper to execute cli commands, but you’d ultimately be doing the same thing, calling sub processes.

👍 1

Aaron Pham

04/12/2023, 9:31 PM

Can you try adding importlib.import_module(“project”) to external_modules

Jiang

04/13/2023, 1:37 AM

@Seungchan Lee Hi Seungchan.

save_model

is actually part of the SDK, not the command line. The difference here is that save_model is used to save a model object located in Python memory to a file, and therefore it must be executed with code at the end of the training pipeline. If we want to support the REST API as you mentioned, we first need a common convention for sending Python objects in memory via the REST API. However, as we all know, such a convention does not exist. If you have a private protocol, you can implement this REST server yourself, but it cannot be promoted to the community for everyone to use. I'm not sure if I fully understand your question. Can you provide a more specific scenario to help me understand better?

Seungchan Lee

04/13/2023, 1:41 AM

@Aaron Pham Jim Rohrer actually helped me with the original problem. Thanks for suggesting a solution though!

🍻 1

Seungchan Lee

04/13/2023, 1:42 AM

@Jiang Right that makes sense - I didn’t think through the

save_model

part. As for the motivation for wanting REST api for cli commands is for easier automation.

Seungchan Lee

04/13/2023, 1:43 AM

Running bentoml cli requires spawning subprocesses, which is harder to work with than a REST api. It’s slower and harder to work with responses (since stdout is just text that needs to be parsed) as well as handling errors (stderr is really tricky to handle)

Seungchan Lee

04/13/2023, 1:44 AM

Also harder to grab model/bento data from our internal dashboard

Jiang

04/13/2023, 1:44 AM

Sure, that will be great. Kudos to @Aaron Pham

Open in Slack

Previous Next