This message was deleted BentoML #ask-for-help

Join Slack

This message was deleted.

# ask-for-help

Slackbot

10/08/2022, 3:30 PM

This message was deleted.

Chaoyu

10/08/2022, 11:41 PM

cc @Jiang @Sean

Jiang

10/09/2022, 3:12 AM

@Yakir Saadia Could you please share the service definition here?

Yakir Saadia

10/09/2022, 5:07 AM

What do you mean by service definition?

Jiang

10/09/2022, 5:08 AM

the source code you defined the bentoml Service

Yakir Saadia

10/09/2022, 5:10 AM

Do you mean this?

svc = bentoml.Service("AppManager", runners=[r1, r2, r3, r4])

Yakir Saadia

10/09/2022, 5:11 AM

Or the endpoint definition:

Copy code

@svc.api(input=File_IO(mime_type="image/jpeg"), output=JSON_IO())                                                                                                                                                                                                                                                            async def predict_example4(input_file):

Jiang

10/09/2022, 5:14 AM

And also the place you called the runner

Jiang

10/09/2022, 5:14 AM

Plus the full error log

Yakir Saadia

10/09/2022, 5:16 AM

This is where I called the runners:

Copy code

pred1, pred2 = await asyncio.gather(                                                                                                                                                                                                                                                                                             runner1.predict.async_run(img_tensors),                                                                                                                                                                                                                                                                         runner2.predict.async_run(img_tensors)                                                                                                                                                                                                                                                               )

And I don't have at the moment the full error log

Yakir Saadia

10/09/2022, 6:01 AM

@Jiang?

Jiang

10/09/2022, 6:04 AM

How did you serve it

Jiang

10/09/2022, 6:04 AM

bentoml serve ...

bentoml serve --production

Yakir Saadia

10/09/2022, 6:06 AM

@Jiang with --production

Yakir Saadia

10/09/2022, 6:13 AM

I want to highlight the fact that it happens only at a certain load (The server itself still has plenty of resources). It makes me think that the runner behaves unexpectedly when there is a certain amount of requests handled

Jiang

10/09/2022, 6:24 AM

Did you include runner1 in the

svc = Service(runners=...

Yakir Saadia

10/09/2022, 6:25 AM

Yes. All runners are included in the service definition and my runners are usually working. As I highlighted, it happens only at a certain load

Jiang

10/09/2022, 6:28 AM

I believe I need the full log here

Jiang

10/09/2022, 6:32 AM

https://github.com/bentoml/BentoML/issues/2271 Since you are using async endpoints(which is recommended of course), I believe situation is different from the issue

Yakir Saadia

10/09/2022, 6:42 AM

Because of this post I have mentioned that I have aiohttp version 3.8.1

Yakir Saadia

10/09/2022, 6:43 AM

@Jiang I have also encountered this exception: Can you advise?

Jiang

10/09/2022, 7:02 AM

This is basically saying that the connection was already closed for some reason

Jiang

10/09/2022, 7:04 AM

Did you see any exception from the runner component?

Jiang

10/09/2022, 7:05 AM

(the

[api_server[

at that start of the log line basically means this exception happens on the api_server component)

Yakir Saadia

10/09/2022, 7:16 AM

I didn't see an exception from the runner. What I have shared in the image is the only exception seen

Jiang

10/09/2022, 8:44 AM

I will try to reproduce this issue. Would you be comfortable trying to do the same test in a docker environment?

Yakir Saadia

10/09/2022, 10:27 AM

Can't at the moment. In the meantime I will try to get you the full log for the main issue in this thread

Yakir Saadia

10/09/2022, 5:15 PM

@Jiang Here is the traceback to the exception (the main issue in this thread)

Yakir Saadia

10/09/2022, 5:19 PM

@Jiang I think I solved it. From the traceback I was able to figure out that the cause to this problem was that I tried running the inference with "async_run" from an async endpoint

Open in Slack

Previous Next