This message was deleted BentoML #ask-for-help

Join Slack

This message was deleted.

# ask-for-help

Slackbot

02/04/2023, 2:27 PM

This message was deleted.

Chaoyu

02/04/2023, 7:43 PM

cc @larme (shenyang)

Aaron Pham

02/05/2023, 12:28 AM

can you make the tokenizer data as a partial kwargs?

larme (shenyang)

02/05/2023, 2:36 AM

Hi Suhas, could you upgrade BentoML to the latest version by

pip install -U bentoml

? The latest BentoML shouldn't have this limitation

Suhas

02/05/2023, 12:52 PM

I have version 1.0.13, still i get these issue

Suhas

02/05/2023, 1:07 PM

I could see error in main branch of bentoml https://github.com/bentoml/BentoML/blob/83cf8d20d983bd8e9dcb4b51a794f9257b6f125f/src/bentoml/_internal/frameworks/utils/onnx.py#L109

larme (shenyang)

02/05/2023, 7:07 PM

Hi Suhas, that part of codes should only be invoked when the model input type description is a tensor type. Maybe there's a hidden bug. We will appreciate if you could provide a minimal example to reproduce this behaviour. Thanks!

Suhas

02/05/2023, 9:27 PM

For example: you can convert bert-base model to onnx, create onnx runner with providers CUDA load tokenizer as below

Copy code

from transformers import BertTokenizer
tokenizer = BertTokenizer.from_pretrained("bert-base-uncased")
tokeinzer_data=tokenizer.tokenize("I have a new GPU!")

Initialise model locally For inference use onnx_runner onnx_runner.run.run(tokeinzer_data)

larme (shenyang)

02/07/2023, 7:18 AM

Could you try that?

Copy code

tokens = list(tokenizer("this is a sample", return_tensors="np").values())
runner.run.run(*tokens)

Suhas

02/07/2023, 8:53 AM

Thanks it work's

larme (shenyang)

02/07/2023, 9:13 AM

Thanks for the feedback. We will add a bert/transfomers ONNX example in our docs.

Suhas

02/07/2023, 9:14 AM

Perfect, providing example would help new developers. Thanks for looking into

4 Views

Open in Slack

Previous Next